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Flu papers warrant full publication 


Although more debate is needed, the benefits of publishing sensitive data outweigh the risks 


that have so far been made public. 


influenza virus could be misused, and the motivations 

for doing so, but the consequences could be catastrophic. 
There are many scenarios to consider, ranging from mad lone scientists, 
desperate despots and members of millennial doomsday cults to nation 
states wanting mutually assured destruction options, bioterrorists or 
a single person’s random acts of craziness. These are low-probability 
events, but they could introduce a new evolutionary H5N1 seed into 
the environment that seems not to exist in nature. This might not cause 
a pandemic instantly, but it could start the virus on a new path for 
pandemic evolution” 

That is the rationale provided by Paul Keim, acting chair of the 
US National Science Advisory Board for Biosecurity (NSABB), in 
response to questions posed by Nature (P. S. Keim Nature 482, 156-157; 
2012) about the NSABB’s recommendation that recent work on the 
transmissibility in mammals of artificial strains of avian H5N1 influ- 
enza virus should not be published in full. The work was conducted 
in ferrets — generally considered the best animal models for human 
transmission — and shows that avian H5N1 viruses have a greater 
potential to evolve into transmissible forms in mammals, including 
humans, than had been thought. The work is reported in two papers 
accepted but not yet published in Nature and Science. 

Last week, a group of flu and public-health experts gathered at 
the World Health Organization (WHO) headquarters in Geneva, 
Switzerland, to discuss the matter (see go.nature.com/uyrluu). And 
it was clear at the meeting that the above opening quote expresses the 
only rationale that attendees had received. 

To its credit and that of the US government, the NSABB is the only 
body in the world set up to review these issues in a systematic fashion. 
It includes ex-officio representatives of all relevant government depart- 
ments (including intelligence and security agencies), as well as inde- 
pendent researchers. The NSABB’s guidance was an important first step 
in public consideration of the impacts and potential regulation of such 
research. The second step was last week’s meeting at the WHO — again, 
like the NSABB, a body empowered only to make recommendations. 

Some context is important in considering the issues surrounding 
publication. In 2003, Nature and many other journals met to establish 
editorial procedures for considering papers that have public-health 
and scientific benefits but that might also have biosecurity risks (see 
Nature 421, 771; 2003). The statement that emerged from that meet- 
ing envisaged the possibility that a journal would reject a paper if it 
was clear that the risks of publication outweighed the benefits. Nature 
accordingly used independent advisers in considering the submission 
of the latest paper, and most of the advisers recommended publica- 
tion in full. This is also the first paper submitted to any Nature journal 
for which recommendations have been made against publication on 
biosecurity grounds. 

Rather than simply reject the papers, given also the NSABB’s opinion, 


“N: one should presume to know all the ways in which 


both Nature and Science decided to investigate another option: to pub- 
lish a redacted version omitting key methods and data. But a condition 
of such an approach was that a method should exist for distributing a 
full version to those in need of the results for public-health reasons 
and those capable of pursuing the science. Both journals accordingly 

prepared full and redacted versions. 
Those at the WHO meeting, under conditions of strict security, 
examined both versions of the two papers. It had already been said 
in blogs and news coverage that, because the 


“There is methods used are not novel, and because one 
alreadya of the papers had been presented at an open 
substantial meeting, redaction would be pointless. As one 
immediate WHO participant said: “It was only when I'd 


seen both versions that I realized how ineffec- 
tive redaction would be?” What was also con- 
cluded was that a system for distributing the full paper only to selected 
individuals would be impossible to set up on any relevant timescale. 

But what also became clear, partly from unpublished data, was that 
not only does the mammalian transmissibility threat seem greater than 
previously thought, but also that current avian viruses have some of the 
mutations identified in the new work. In other words, there is already a 
substantial immediate risk to humans. The meeting also concluded that 
the new data are of value for surveillance, and that the results should be 
built on to explore the mechanisms underlying transmissibility and the 
high fatality rate observed in humans infected by H5N1. 

Given the inadequacy of redaction, and the immediate risks to global 
public health, the biosecurity objections expressed above seem too gen- 
eral and hypothetical to justify obstructing publication and further 
research. Moreover, with regard to the NSABB’s recommendations 
and the recommendations of the WHO meeting (see go.nature.com/ 
ky2skc), neither of the discussions that preceded them were sufficiently 
inclusive of the security, societal and research interests at stake. 

Therefore, further discussion is essential. That must include a review 
of the safety regimes (lab equipment, buildings and practices) in which 
future work should be conducted. The two laboratories in which the 
latest research originated are categorized as ‘BSL-3 enhanced’ (see 
Nature 480, 421-422; 2011), a classification that, although rigorous in 
these cases, is not well defined in general. The Public Health Agency 
of Canada has deemed the highest level of BSL-4 to be required (see 
page 447). Safety-standards committees in the United States and 
Europe are currently assessing required safety levels, and may report 
within a few weeks. 

As was agreed by the journals and the lead authors at the meeting, 
publication of the papers must wait at least for the outcome of those 
discussions. There may yet be regulatory or legal obstacles to publica- 
tion, or biosecurity or biosafety risks sufficient to outweigh the health 
risks. Otherwise, it is Nature’s view that the papers should ultimately 
be published in full. = 
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Turing at 100 


This year marks the centenary of the birth of 
Alan Turing. He deserves your attention. 


London Olympics kicks off. So it seems apt that, in a special 

issue this week, Nature invites its readers to embrace and 
celebrate a superb marathon runner — who also happened to be one 
of the brightest minds of all time. 

Alan Turing, computer pioneer, wartime code-breaker and poly- 
math, was born in London on 23 June 1912. But for injury, he would 
probably have joined the British Olympic team for the London games 
of 1948. (His personal best marathon time of 2 hours and 46 min- 
utes was barely 11 minutes behind the gold medallist that year.) Yet, 
100 years and one month after his birth, when the Olympics will return 
to the city, no official celebration of the connection is planned. An 
opportunity to bring an intellectual giant — and science itself — to 
the attention of the international public will be missed. 

Turing’s marathon time gives us an objective quantification of 
his physical excellence. His scientific genius and legacy, however, 
are much more difficult to measure — as his biographer, Andrew 
Hodges, a mathematician at the University of Oxford, UK, points out 
on page 441. Still, setting aside quarrels over his role in the develop- 
ment of the computer, the scientific world should stand together and 
relish the wonderful diversity of a universal mind. (See the special 
section starting on page 455 and www.nature.com/turing for more.) 

The scope of Turing’s achievements is extraordinary. Mathematicians 
will honour the man who cracked David Hilbert’s Entscheidungsproblem 
or decision problem, and cryptographers and historians will remember 
him as the man who broke Nazi Germany’s Enigma code and helped 
to shorten the Second World War. Engineers will hail the founder of 
the digital age and artificial intelligence. Biologists will pay homage to 
the theoretician of morphogenesis, and physicists will raise a glass to 
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the pioneer of nonlinear dynamics. Philosophers, meanwhile, are likely 
to continue to frown over his one-liners on the limits of reason and 
intuition: “Ifa machine is expected to be infallible, it cannot also be 
intelligent,” he said in a 1947 talk to the London Mathematical Society. 
Turing demonstrated a terrific ability to combine first-hand experi- 
mentation, keen observation, rigorous theory and practical application. 
His multidisciplinary approach alone makes him ofinterest to this jour- 
nal, yet questions still arise on whether the 


“Turing *smind best papers in pure mathematics, computer 
was truly his science and artificial intelligence should be 
own, and this published in Nature. We certainly think so. 

contributed to So, too, do the researchers invited to decode 
the tragedy of Turing’s legacy in a series of Commentarticles, 


his life. ” starting on page 459. They are thought- 
provoking pieces in their own right, but, more 
importantly, we hope that they will entice readers to seek out Turing’s 
original work (see, for example, B. J. Copeland (ed.) The Essential 
Turing; Clarendon, 2004). His papers are models of accessibility and 
clarity, despite their extreme conceptual depth and intellectual rigour. 
Even his throwaway comments — about symmetry in physics versus 
biology, randomness in intelligence, learning in unorganized machines, 
or emotions in extrasensory perception, for example — are gems. 

Turing’s mind was truly his own, and this contributed to the trag- 
edy of his life. Turing was persecuted by the British authorities for his 
homosexuality, and used cyanide to take his own life, aged 41. 

That 2012 will see numerous events commemorating Turing world- 
wide (see, for example, www.turingcentenary.eu) is almost entirely 
down to volunteers, who have received little or no official help. This is 
in stark contrast to the World Year of Physics in 2005, when the German 
state helped to promote the centenary of Albert Einstein’s ‘miracle year, 
in which he published his four groundbreaking papers. 

What could 2012, the Alan Turing year, be named? Nature suggests 
“The Year of Intelligence’. Of the finest types of intelligence — human, 
artificial and military — Turing is perhaps the only person to have 
made a world-changing contribution to all three. Use this special issue, 
and the rest of 2012, to discover and make up your own mind about 
this extraordinary man. m 


Over the line 


Dishonesty, however tempting, is the wrong 
way to tackle climate sceptics. 


this publication urged researchers to acknowledge that they are 
involved in a street fight over the communication of climate 
science. So would it now be hypocritical to condemn Peter Gleick 
for fighting dirty? Gleick, a hydroclimatologist and president of the 
Pacific Institute for Studies in Development, Environment and Secu- 
rity in Oakland, California, admitted in a statement on news website 
The Huffington Post on 20 February that he had duped the Heart- 
land Institute, a right-wing think tank based in Chicago, Illinois, into 
handing over documents that detailed its financial support for cli- 
mate sceptics. Gleick had passed these documents on to the website 
DeSmogBlog.com, which made them public on 14 February. 
Gleick’s deception — using an e-mail address set up in someone 
else’s name to request the documents from Heartland — is certainly 
in line with some of the tactics used to undermine climate science. 
When in November 2009 a hacker distributed thousands of e-mails 
stolen from climate researchers at the University of East Anglia in 
Norwich, UK, Heartland was prominent among those who criticized 
not the hacker, but the scientists who wrote the messages. However, 
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Gleick, as he has admitted, crossed an important line when he acted 
in such a duplicitous way. It was a foolish action for a scientist, espe- 
cially one who regularly engages with the public and critics. Society 
rightly looks to scientists for fairness and impartiality. Dishonesty, 
whatever its form and motivation, is a stain on the individual and 
the profession. Gleick does deserve credit for coming clean — but, 
it must be said, he did so only after he was publicly accused on the 
Internet of being involved. 

The original accusation, incidentally, was more serious: that Gleick 
had deliberately forged a Heartland Institute memo that brought 
together, with suspicious convenience, the most incriminating sections 
of the other climate documents, which seem to have been presented to 
the Heartland board meeting in January. He denies doing so, and says 
that he received the memo, in which he is named and which Heartland 
says has been faked, separately from an anonymous source. The e-mail 
chicanery, he says, was an attempt to check whether it was genuine. 

In his statement on Monday, Gleick said: “My judgment was blinded 
by my frustration with the ongoing efforts — often anonymous, well- 
funded, and coordinated — to attack climate science and scientists and 
prevent this debate, and by the lack of transparency of the organizations 
involved. Nevertheless I deeply regret my own actions in this case.” 

On 24 January, Gleick had published another article in The Huff- 
ington Post, entitled ‘Climate Change: Sifting 
Truth From Lies in a Complex World’ As he 
now knows, the best way for scientists to help 
people find this truth is through open and 
honest debate. m 
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also for his 1952 conviction for having gay sex (illegal in Brit- 

ain until 1967) and his suicide two years later. Former Prime 
Minister Gordon Brown issued an apology to Turing in 2009, anda 
campaign for a ‘pardon was rebuffed earlier this month. 

Must you be a great figure to merit a ‘pardon’ for being gay? If so, how 
great? Is it enough to break the Enigma ciphers used by Nazi Germany 
in the Second World War? Or do you need to invent the computer as 
well, with artificial intelligence as a bonus? Is that great enough? 

Turing’s reputation has gone from zero to hero, but defining what he 
achieved is not simple. Is it correct to credit Turing with the computer? 
To historians who focus on the engineering of early machines, Turing 
is an also-ran. Today’s scientists know the maxim ‘publish or perish’, 
and Turing just did not publish enough about 
computers. He quickly became perishable goods. 
His major published papers on computability 
(in 1936) and artificial intelligence (in 1950) are 
some of the most cited in the scientific literature, 
but they leave a yawning gap. His extensive com- 
puter plans of 1946, 1947 and 1948 were left as 
unpublished reports. He never put into scientific 
journals the simple claim that he had worked out 
how to turn his 1936 “universal machine” into 
the practical electronic computer of 1945. Turing 
missed those first opportunities to explain the 
theory and strategy of programming, and instead 
got trapped in the technicalities of primitive stor- 
age mechanisms. 

He could have caught up after 1949, had he 
used his time at the University of Manchester, 
UK, to write a definitive account of the theory 
and practice of computing. Instead, he founded a new field in math- 
ematical biology and left other people to record the landscape of com- 
puters. They painted him out of it. The first book on computers to be 
published in Britain, Faster than Thought (Pitman, 1953), offered this 
derisive definition of Turing’s theoretical contribution: 

“Tiiring machine. In 1936 Dr. Turing wrote a paper on the design 
and limitations of computing machines. For this reason they are some- 
times known by his name. The umlaut is an unearned and undesirable 
addition, due, presumably, to an impression that anything so incom- 
prehensible must be Teutonic.” 

That a book on computers should describe the theory of comput- 
ing as incomprehensible neatly illustrates the climate Turing had to 
endure. He did make a brief contribution to the book, buried in chap- 
ter 26, in which he summarized computability 


A lan Turing is always in the news — for his place in science, but 


ANYONE LOOKING 
INTO HIS 


STORY 
AFTER HIS DEATH 
WOULD SEE 
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THAT HE HAD BEEN 
PERSONA NON GRATA. 


The man behind 
the machine 


Alan Turing is famous for many reasons. Andrew Hodges delves into why 
Turing’s achievements took so long to be recognized. 


The 1955 Royal Society’s obituary of Turing, written by 
mathematician Max Newman, did him few favours when it claimed 
that computer designers were unaware of Turing’s 1936 work. The 
Turing machines soon made a comeback, but Turing’s image had 
become that of a pure mathematical logician, unrelated to practi- 
cality. It did not help that anyone looking into his story after his 
death would see dark hints that he had been persona non grata in an 
unmentionable manner — possibly excusable for a remote theorist 
from Cambridge University, but totally inappropriate for the founder 
of a mega-industry. 

Yet the mid-1970s revealed Turing to have been highly practical: the 
chief scientific figure at code-breaking headquarters Bletchley Park, 
and in charge of methods and state-of-the-art machines for beating 
the German navy. Now it was clear why he had 
emerged as a computer builder in 1945 — he 
had gained experience he could never reveal. 
By the 1970s, there was also more room for 
his vision of computation. Software for “every 
known process’, as he foresaw in 1946, was on 
the way. Turing’s vision of mind and machine, 
which drew from his personal consciousness 
and experience, also became more acceptable. 
When in 1977 I started to investigate Turing’s life, 
I found that his code-breaking was the hidden 
bridge between the 1936 theory and the “univer- 
sal practical computing machine” he described 
in his unpublished 1948 work. 

On the question of individual reputation, in 
that 1948 report he wrote: “The isolated man 
does not develop any intellectual power. It is 
necessary for him to be immersed in an envi- 
ronment... He may then perhaps do a little research of his own and 
make a very few discoveries ... the search for new techniques must be 
regarded as carried out by the human community as a whole, rather 
than by individuals.” Science is like that, and he effaced himself in that 
spirit. But he was a star nonetheless. 

What would Turing have thought of the campaign for his ‘pardon’? 
When arrested, he was unrepentant and told police he expected a 
“Royal Commission to legalize it”. Sixty years later, British law has 
caught up, not for him as a special case, but as a matter of princi- 
ple. That practical action speaks louder than symbolic words, and is 
truer to his vision. I see the question not as whether the government 
should have pardoned Turing, but how on Earth Turing could ever 
have pardoned the government. = 


and the universal machine. However, his low- 
key account never conveyed that these central 
concepts were his own, or that he had planned 
the computer revolution. 
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nature.com/turing 


Andrew Hodges is a mathematician at the 
University of Oxford, UK, and author of Alan 
Turing: the Enigma. 
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RESEARCH HIGHLIGHTS 


Claustrophobic 
DNA in tug of war 


When a long thin polymer 
such as DNA is forced into a 
confined space — say a small 
membrane channel — it loses 
some of its freedom, and hence 
its entropy. Regaining that 
entropy is a powerful driving 
force for escape. 

Chia-Fu Chou at the 
Academia Sinica in Taipei 
and his colleagues used an 
electric pulse to force a single 
DNA molecule to extend from 
one microchannel to another 
through a restrictive gap just 
nanometres high. When the 
electric field was turned off, 

a tug-of-war lasting from 
seconds to minutes occurred as 
both ends of the DNA tried to 
pull out of the nanometre-sized 
space. Eventually, one side won 
and the DNA retracted. 

The forces acting on the 
DNA depended only on the 
height of the confined passage 
between channels, and not 
on its length or the length 
of DNA passing through 
it. This understanding 
could aid applications from 
molecular filters to nanopore 
transporters, the authors say. 
Nano Lett. http://dx.doi. 
org/10.1021/nl2045292 (2012) 


Sideways 
activation 


Elucidation of a 
cell receptor’s 
crystal structure 
has revealed 
a unique lateral 
docking mechanism, 
report Hugh Rosen of 
the Scripps Research 
Institute in La Jolla, 
California, and his 
colleagues. 
G-protein-coupled 
receptors (GPCRs) are 


Selections from the 
scientific literature 


EVOLUTION 


Lilliputian lizards come to light 


The forests of northern Madagascar harbour a 
dwarf chameleon that is the smallest lizard in 
the world in terms of total length. Adult males 
of the diminutive Brookesia micra reach 
a length of less than 24 millimetres. 

B. micra and three other tiny lizard species 
were discovered in the region's rainforests and 
dry forests. Miguel Vences at the Technical 


University of Braunschweig in Germany and his 
group analysed tail length and head width, male 
genital morphology and gene sequences to place 
each species within the chameleon taxonomy. 
All occupy a small, discrete geographical range, 
and probably evolved some 10 million to 

20 million years ago, the authors suggest. 

PLoS ONE 7, e31314 (2012) 


signalling molecules that span 
the plasma membranes of cells 
and are generally activated 
by external molecules that 
pass through a channel-like 
opening into a binding site. 
However, the researchers 
determined the crystal 
structure of the sphingosine 
1-phosphate receptor 1 (S1P,, 
pictured) and showed that 
the receptor is triggered by 
certain lipids passing through 
the plasma membrane and 
binding through the lateral 
docking mechanism. 
They also found that 
S1P, has atypical 
binding sites in 
less conserved 
regions of the 
docking site and that 
compounds that adhere 
to these activate S1P, more 
specifically than do lipids. 
Science 335, 851-855 (2012) 
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Immunity’s 
circadian link 


Daily patterns in the body’s 
biochemical and physiological 
processes called circadian 
rhythms may influence 
immune-system function. Erol 
Fikrig and his colleagues at 
Yale University in New Haven, 
Connecticut, have found that 
the expression of an immune 
protein called TLR9 rises and 
falls with the circadian cycle. 

They induced sepsis in mice 
to examine whether pathogen 
recognition — a key part of the 
immune response — varies 
with circadian cycles. Higher 
TLR9 expression at the time of 
sepsis induction was linked to 
a worse outcome for mice. This 
suggests that daily fluctuations 
in biological processes may 


influence vulnerability to 
infections, as well as the 
efficacy of immune therapies 
such as TLR9 agonists, which 
are currently in development. 
Immunity http://dx.doi.org/ 
10.1016/j.immuni.2011.12.017 
(2012) 


Restore my 
beating heart 


Infusions of a patient's own 
cardiac stem cells may reduce 
scar tissue and promote 
heart-muscle growth after a 
heart attack, according toa 
small safety study. Eduardo 
Marban of the Cedars-Sinai 
Heart Institute in Los Angeles, 
California, and his colleagues 
harvested heart cells from 17 
heart-attack patients. The cells 
were used to grow cardiac stem 


J. KOEHLER 


cells that were then reinfused. 
Six months later, patients 
had 28% less scar tissue mass 
than control patients who did 
not receive the infusion. Viable 
heart tissue mass also increased 
following the treatment, 
suggesting partial restoration 
of tissue lost during the heart 
attack. However, patients 
showed no improvement in 
several measurements of heart 
function, such as the volume 
pumped out of the left ventricle 
with each heartbeat. 
Lancet http://dx.doi. 
org/10.1016/S0140- 
6736(12)60195-0 (2012) 


| GENOMICS 
Loss-of-function 
found in droves 


Genome-sequencing work has 
suggested that even healthy 
humans carry hundreds of ‘loss 
of function (LoF) mutations 
that seriously disrupt 
protein-coding genes. Daniel 
MacArthur at the Wellcome 
Trust Sanger Institute 
in Hinxton, UK, and his 
colleagues performed extensive 
analysis on 185 genomes and. 
determined that a typical 
individual carries around 100 
LoF variants, of which about 20 
inactivate both copies of a gene. 
Most of the common 
mutations occurred in non- 
essential genes and didn’t seem 
to affect health. The team 
also identified many rare LoF 
variants found in less than 1% 
of the population, including 47 
serious disease mutations in 
one copy ofa gene. By studying 
differences between the 
harmful and neutral variants, 
the scientists developed 
an algorithm to prioritize 
mutations found in medical 
genome sequencing for further 
investigation. 
Science 335, 823-828 (2012) 


Zombie star 
rising 

When a star suddenly 
brightened in 1961, many 


assumed it had died ina 
supernova — but it seems 


that the light has not yet 
gone out. Schuyler Van Dyk 
at the California Institute of 
Technology in Pasadena and 
Thomas Matheson at the 
National Optical Astronomy 
Observatory in Tucson, 
Arizona, examined ground- 
and space-based observations, 
and say that it still lives. 

The duo reports that the 
star, designated ‘Object 7; 
can be seen on the Hubble 
Space Telescope as a luminous 
blue variable (LBV) star. 
The authors suggest that 
the decades-old outburst 
could represent a ‘supernova 
imposter; a type of explosion 
for which LBVs are known 
that doesn’t destroy the parent 
star. Nevertheless, Object 7 
may be on course to explode, 
and astronomers should look 
out for its stellar death rattle. 
Astrophys. J. 746, 179 (2012) 


IMMUNOLOGY 


Immune system 
master switch 


The fetal immune system 
develops from stem cells in 
the liver, whereas the immune 
cells that protect adults 

form in the bone marrow. 
Moreover, early in life the 
immune system contains cells 
that quickly respond to only 

a limited number of foreign 
molecules; adult immune cells 
can recognize almost anything 
that might harm a host. 

A ‘master-switch’ gene 
called Lin28b accounts for 
these differences, report Stefan 
Muljo and his team at the 
National Institute of Allergy 
and Infectious Diseases in 
Bethesda, Maryland. Lin28b 
— which blocks a class of gene- 
regulating RNA fragments 
called microRNAs — is active 
in the stem cells that form 
a mouse’s immune system 
early in life, yet is absent from 
adult bone marrow. Marrow 
cells engineered to express a 
closely related gene, Lin28, and 
transplanted into adult mice 
form fetal-like immune cells. 

Because the fetal-like 
immune cells are known to 
be effective against some 
pathogens, cancers and other 
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COMMUNITY 


CHOICE 


Simple solution for tricky chemistry 


> HIGHLY READ 
on pubs.acs.org 
In January 


Chemists have invented a reagent to 
5 ease the addition of a desirable chemical 
group to many useful compounds. 


Pharmaceutical, medicinal and agricultural 
chemists add fluorine groups to molecules to improve certain 
properties — to lower toxicity, for example. However, adding 
a difluoromethyl group has proved complicated. 

The process developed by Phil Baran at the Scripps 
Research Institute in La Jolla, California, and his colleagues 
does the job in a simple one-pot reaction. The authors used 
zinc difluoromethylsulphinate salt, a white powder that is 
soluble in water and stable in air, making it easy to handle. 

In water, this produces a reactive difluromethyl radical that 
targets specific sites on other molecules. 

In particular, Baran’s reagent can add a difluoromethyl 
group to nitrogen-containing aromatic ring systems and onto 
some organic molecules containing sulphur. 

J. Am. Chem. Soc. 134, 1494-1497 (2012) 


diseases, coaxing transplanted 
bone marrow cells to take on 
fetal properties could be used 
to improve immune responses. 
Science http://dx.doi.org/ 
10.1126/science.1216557 (2012) 


Antifreeze’s role 
in fish spread 


Antifreeze proteins in the 
bodily fluids of Antarctic 
fishes are a crucial adaptation 
to life in the freezing waters 
— but their appearance alone 
is insufficient to explain the 
huge diversity of the region's 
fish species. Thomas Near of 
Yale University in New Haven, 
Connecticut, and his colleagues 
constructed a phylogeny of 
these notothenioid fishes 
(a sample pictured) and 
correlated it to both the 
appearance of the proteins and 
changes in global climate. 
Contrary to the perception 
that the appearance of 
antifreeze proteins was the 
crucial factor driving evolution, 
they found that the most 
species-rich lineages diversified 
at least 10 million years after 
the proteins’ appearance. This 
bout of evolution happened 
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during a second cooling 

event in the Late Miocene 

(11.6 million to 5.3 million 
years ago), when ice activity in 
the Southern Ocean is thought 
to have increased. The authors 
suggest that the appearance 

of this new polar habitat, 
combined with the pre-existing 
antifreeze proteins, spurred the 
evolution of notothenioids. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1115169109 (2012) 
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SEVEN DAYS sescnisc 


Flu work freed 


Two studies that created 
ferret-transmissible strains 

of the highly pathogenic 

avian H5N1 influenza A 

virus should be published in 
full, a meeting of 22 experts 
convened by the World Health 
Organization in Geneva, 
Switzerland, concluded on 

17 February. Last December, 
the US government and the 
US National Science Advisory 
Board for Biosecurity had 
asked that the research be 
censored. See pages 439 and 
447 for more. 


Climate politics 

A high-profile water and 
climate scientist acknowledged 
on 20 February that he had 
dishonestly acquired internal 
budget documents from 

the Heartland Institute, a 
libertarian think tank in 
Chicago, Illinois, that aims 

to combat climate science. 
Peter Gleick, president of the 
Pacific Institute in Oakland, 
California, released the 
documents to environmental 
website DeSmogBlog. 
Heartland has not disputed 
the authenticity of most of the 
papers, but says that a strategy 
memo — which Gleick says 
he received anonymously — is 
fake. See go.nature.com/v1zrbu 
and page 440 for more. 


Animal testing 


Tens of millions of animals 
have been saved from use 

in chemical safety tests, 

after Europe's chemical 
regulator gave the go-ahead 

to astreamlined method for 
checking substances effects on 
animals’ reproductive systems. 
Toxicologists have been 
concerned that up to 54 million 
animals could be required 

for extra tests mandated 

by the European Union's 
sweeping 2007 chemicals 
legislation — with most of the 


The struggle against soot 


An international coalition has launched a modest 
fund to curb emissions of methane, black carbon 
(soot) and other short-lived climate-affecting 
pollutants (see Nature 481, 245-246; 2012). 

The United States, Canada, Sweden, Mexico, 
Bangladesh and Ghana founded the programme, 
which was unveiled in Washington DC on 16 


increase down to reproductive- 
toxicity tests that have to be 
done in two generations of 
animals. But on 15 February, 
the European Chemical 
Agency, based in Helsinki, 
approved a test that uses only 
one generation. See go.nature. 
com/optzux for more. 


AIDS budget cut 


Health advocates said last 
week that they were dismayed 
by planned cuts to the US 
administration's global AIDS 
programme. According to the 
Kaiser Family Foundation, 
health-policy analysts 
headquartered in Menlo 
Park, California, President 
Barack Obama's 2013 budget 
request would cut 13% 
(US$543 million) from the US 
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state department's support for 
HIV work, although it would 
add 27% ($350 million) to the 
Global Fund to Fight AIDS, 
Tuberculosis and Malaria. 

See go.nature.com/rcdhhv for 
more. 


lran concern 


Iran has responded to 
tightened trade sanctions 

by claiming that it has made 
technical advances in its 
nuclear programme, including 
building a new generation of 
centrifuges to enable faster 
enrichment of uranium. 
Diplomats from the United 
States and Europe dismissed 
the pronouncements on 

15 February as political 
bluster. As Nature went to 
press, inspectors from the 


February. With initial funding of US$15 million, 
it will aim to support projects such as cleaning up 
inefficient biomass stoves, brick kilns (pictured, 
in Kabul), diesel vehicles and coke ovens; and 
reducing gas leakage from rice paddies, landfills, 
wastewater systems and oil and gas extraction. 
See go.nature.com/nu3ak5 for more. 


International Atomic Energy 
Agency in Vienna were visiting 
Tehran to discuss Iran's nuclear 
programme for the second 
time in three weeks. 


Drug trials rap 

On 14 February, three senior 
Democrats in the US House 

of Representatives questioned 
the National Institutes of 
Health and the Food and Drug 
Administration over their 
apparent failure to enforce the 
public reporting of clinical- 
trial results. Under a 2007 

act, sponsors must report the 
results of trials of already- 
approved drugs and devices on 
clinicaltrials.gov within a year 
of completion — or be fined. 
A study (A. P. Prayle et al. 

Br. Med. J. 344, d7373; 2012) 
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published in January found 
that results of only 22% of 738 
trials completed in 2009 were 
reported in time. 


Pe RESEARCH 
Fracking risks 


There is little or no evidence 
that fracking — pumping 
high-pressure fluids into 
shale to force out natural 

gas — has contaminated 
groundwater, according to 

a university-funded report 
(see go.nature.com/sopiwm) 
from researchers assembled 
by the Energy Institute at the 
University of Texas at Austin. 
The report, released on 

16 February at the meeting of 
the American Association for 
the Advancement of Science 
in Vancouver, Canada, found 
that harm ascribed to the 
controversial technique could 
usually be traced to above- 
ground chemical spills or 
problems common to all oil 
and gas drilling operations, 
such as casing failures. 


Dioxin health risk 
The US Environmental 
Protection Agency has released 
along-delayed assessment of 
the health risks of dioxins — 
work that has taken more than 
two decades to produce. In 
line with its 2010 draft report, 
the agency recommends a 

safe consumption limit for the 
chemicals that is well below 
that proposed by the World 
Health Organization. But it 


A study of the careers of nearly 
3,000 tenure-track science and 
engineering assistant professors 
in 14 US universities suggests 


that men and women are retained 


and promoted at about the same 
rate, spending a median time of 
10.9 years at their first university. 
But in mathematics, women 
leave significantly sooner than 
men (see chart). A problem lies 
in hiring: only 27% of incoming 
academics are women, the 
authors point out. See go.nature. 
com/nn23z1 for more. 


also says that current exposure 
to dioxins “does not pose a 
significant health risk” See 
go.nature.com/dh3ary for 
more. 


Greek robbery 


Greece's economic suffering 
has been compounded 

by desecration of its 
archaeological heritage, with 
the robbery of 77 artefacts 
from the Museum of the 
History of the Olympic Games 
in Olympia on 17 February. 
Culture minister Pavlos 
Geroulanos offered to resign 
after the theft, which ministry 
officials said included a 
3,300-year-old gold ring and a 
2,400-year-old oil jar. 


Nanopore sequencer 
Oxford Nanopore 
Technologies, a UK firm 

that promises its technology 
could theoretically sequence a 
human genome in 15 minutes, 
impressed scientists with the 
first public presentation of its 
data on 17 February, at the 
Advances in Genome Biology 
and Technology meeting in 
Marco Island, Florida. The 
technology identifies bases 

in real time by measuring 
electrical conductivity as a 
DNA strand is fed through 

a biological nanopore. The 
company expects to start selling 
its machine in the second half 


RETAINING SCIENCE TALENT 


of this year, and plans to sell 
a miniaturized, disposable 
sequencer (pictured) for less 
than US$900. See go.nature. 
com/evpcle for more. 


ee PE) PEE eee 
Nobel laureate dies 


Virologist Renato Dulbecco, 
who shared the 1975 Nobel 
Prize in Physiology or 
Medicine, died on 19 February, 
aged 97. Dulbecco won the 
Nobel for work in the 1950s 
and ’60s showing that some 
viruses insert their genes into 
the genomes of the cells they 
infect, and that these changes 
can trigger cancer. Bornin 
Italy, Dulbecco also worked in 
the United States and Britain. 
From 1988 to 1992 he was 
president of the Salk Institute 
for Biological Studies in 

San Diego, California. 


MIT head resigns 


The first female president of 
the Massachusetts Institute 
of Technology (MIT) in 
Cambridge has announced 
that she will resign the 

post, after seven years in 
charge. Susan Hockfield, a 


Men and women hired by US science and engineering faculties are 
retained at about the same rate — except in mathematics. 
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SEVEN DAYS | THIS WEEK 


26-29 FEBRUARY 

In Washington DC, 
scientists and policy- 
makers discuss research 
and political efforts 

on biodefence and 
bioterrorism, including 
work that created 
mutant influenza. 
go.nature.com/xnstjz 


27 FEB-2 MARCH 
Graphene, solar- 
energy technology and 
nanoscience remain 
hot topics at this year’s 
American Physical 
Society meeting in 
Boston, Massachusetts. 
go.nature.com/6m7ekb 


neuroscientist, has headed 
MIT since 2004; she previously 
spent two decades at Yale 
University in New Haven, 
Connecticut, including time 

as provost. On 16 February 

she said that she would step 
down when a successor was 
appointed, to pave the way for 
a new fund-raising effort. 


Italy research head 
Italy’s multidisciplinary 
National Research Council 
(CNR), which runs more than 
100 institutes and research 
centres, finally has a new 
president. It should now be 
able to implement a 2009 

law intended to make the 
country’s research system more 
transparent and meritocratic 
(see Nature 476, 386; 2011). 
The reform has dragged 

on because outgoing CNR 
president Francesco Profumo, 
appointed last August, 
declined to resign the post after 
becoming national research 
minister last November. On 

18 February, after Profumo was 
finally pressed into resignation, 
Luigi Nicolais, a chemical 
engineer who is also a member 
of parliament, was appointed 
as president of the CNR. 


> NATURE.COM 
For daily news updates see: 
Www.nlature.com/news 
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Flu meeting opts for openness 


Controversial virus studies should be published and oversight of such work 


strengthened, conference concludes. 


Declan Butler 


21 February 2012 


After weeks of debate, two controversial papers describing forms of the H5N1 avian influenza 
virus capable of transmitting between mammals should be published in full. That was the 
unexpected outcome of a meeting convened last week in Geneva, Switzerland, by the World 
Health Organization (WHO), which also promised to create a more rigorous oversight system 


for such research. 


The decision goes against a recommendation from the US National Science Advisory Board 
for Biosecurity (NSABB), which the US government has adopted as its official position. In 
December 2011, the board said that experimental details of the two studies should be 


redacted from any publications, because of concerns that the information could be used ina 


bioterror attack. The board also feared that publishing the details would prompt more 


laboratories to work on the viruses, making an accidental release more likely. 


The studies, which created forms of H5N1 that can spread between ferrets through airborne 
transmission, are likely to be published in a few months. The 22 experts at the meeting, mainly 
flu researchers, believe that the delay is needed to explain the benefits of the work to the 
public, and allay concerns about its safety. Meanwhile, a 60-day moratorium on similar 
research will be extended until a system is put in place to review levels of biosafety and 
biosecurity. To that end, the WHO intends to convene international discussions among 


regulators and other bodies in the next few months. 


The two researchers at the centre of the controversy say that they are pleased with the 
outcome. “I was pleasantly surprised by the fact that there were unanimous decisions about 
most issues, and strong consensus on the others,” says Ron Fouchier, a flu virologist at 
Erasmus Medical Center in Rotterdam, the Netherlands, whose study has been accepted by 
the journal Science. Yoshihiro Kawaoka of the University of Wisconsin-Madison, lead 
researcher on the other study, adds that the meeting allowed him and Fouchier to explain their 
work, including the potential benefits for surveillance of emerging flu strains (Nature 481, 
417-418; 2012) and for vaccine preparation (Nature 482, 142—143; 2012). “We presented why 
we did these experiments, what we did, what data we obtained, what these data contribute to 
public health and to the scientific field, and why we think the results should be shared,” says 
Kawaoka, whose paper has been accepted by Nature. He adds that data he and Fouchier 
presented on the evolution of H5N1 in the wild clarified the threat from the virus, although he 


would not be drawn on the details, citing confidentiality. 


Microbiologist Paul Keim, who chairs the NSABB and attended the meeting, did not respond to 
Nature’s request for an interview, but is reportedly “disappointed” by the recommendation to 


publish the papers. 


Nature and Science last year agreed in principle to redact the papers, on the condition that the 
US government would develop a mechanism to disseminate the full papers to researchers and 
health officials on a need-to-know basis. But meeting participants concluded that this was 
impractical, and that the potential public-health benefits of the work outweighed any risk of 


publishing the papers in full. 


Biosafety first 


Many flu researchers have already seen the papers, so there was little to be gained by 
restricting their dissemination, says Richard Ebright, a molecular biologist and biodefence 
expert at Rutgers University in Piscataway, New Jersey. It is much more urgent, he says, to 


put in place strict biosafety, biosecurity and oversight provisions for such research. 


David Fidler, an expert in international and national security law at Indiana University in 
Bloomington, points out that the meeting hasn’t actually broken the publication deadlock, 
because Keim and representatives of the US government still do not agree with publishing the 
studies in full. “Most of the meeting’s participants appear to have rejected the US position,” 
says Fidler, “but [have] agreed to the extended moratorium and publication delay in the hope 


that the US government will change its mind.” 


Participants agreed that the mutant viruses should remain in their two containment facilities — 
rated at ‘BSL-3 enhanced’, the second-highest level of biosafety — and that both should be 
reviewed before any work restarts. Didier Houssin, president of the French Evaluation Agency 
for Research and Higher Education, says that the biosafety review of the work must consider 
whether studies of this kind should be conducted only in labs with the highest biosafety rating 
of BSL-4, a restriction imposed this month by Canada. Houssin, who attended the meeting, 
notes that imposing such a restriction globally would curtail similar work because there are just 
a few dozen BSL-4 labs worldwide. The safety level of BSL-3 labs is very variable, he says, 


and so any facilities working on such viruses would need to be rigorously assessed. 


Fidler and other experts note that the meeting did not address the overall risks and benefits of 
the work, or how similar research might be overseen in future. Keiji Fukuda, WHO Assistant 
Director-General for Health Security and Environment, explains that later meetings will deal 


with these topics and will have wider participation. 


Meanwhile, the meeting agreed that it was “critical” for the WHO to form a communications 
plan over the next few months to increase public awareness and understanding of the 
importance of the flu work, and to alleviate public anxieties. But Peter Sandman, a 
risk-communications consultant in Princeton, New Jersey, advises against any attempt by the 
WHO to “educate” the public out of its concerns. As a strategy, he says, it “is thoroughly 


discredited, because it doesn’t work”. 
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US government would develop a mechanism 
to disseminate the full papers to researchers 
and health officials on a need-to-know basis. 
But meeting participants concluded that this 
was impractical, and that the potential public- 
health benefits of the work outweighed any 
risk of publishing the papers in full. 


BIOSAFETY FIRST 

Many flu researchers have already seen the 
papers, so there was little to be gained by 
restricting their dissemination, says Richard 
Ebright, a molecular biologist and biodefence 
expert at Rutgers University in Piscataway, 
New Jersey. It is much more urgent, he says, 
to put in place strict biosafety, biosecurity and 
oversight provisions for such research. 

David Fidler, an expert in international and 
national security law at Indiana University 
in Bloomington, points out that the meet- 
ing hasn’t actually broken the publication 
deadlock, because Keim and representatives 
of the US government still do not agree with 
publishing the studies in full. “Most of the 
meeting's participants appear to have rejected 
the US position,’ says Fidler, “but [have] agreed 
to the extended moratorium and publication 
delay in the hope that the US government will 
change its mind.” 

Participants agreed that the mutant viruses 
should remain in their two containment facili- 
ties — rated at ‘BSL-3 enhanced; the second- 
highest level of biosafety — and that both should 
be reviewed before any work restarts. Didier 
Houssin, president of the French Evaluation 
Agency for Research and Higher Education, 
says that the biosafety review of the work must 
consider whether studies of this kind should 
be conducted only in labs with the highest 
biosafety rating of BSL-4, a restriction imposed 
this month by Canada. Houssin, who attended 
the meeting, notes that imposing such a restric- 
tion globally would curtail similar work because 
there are just a few dozen BSL-4 labs worldwide. 
The safety level of BSL-3 labs is very variable, 
he says, and so any facilities working on such 
viruses would need to be rigorously assessed. 

Fidler and other experts note that the 
meeting did not address the overall risks and 
benefits of the work, or how similar research 
might be overseen in future. Keiji Fukuda, 
WHO Assistant Director-General for Health 
Security and Environment, explains that later 
meetings will deal with these topics and will 
have wider participation. 

Meanwhile, the meeting agreed that it was 
“critical” for the WHO to form a communica- 
tions plan over the next few months to increase 
public awareness and understanding of the 
importance of the flu work, and to alleviate 
public anxieties. But Peter Sandman, a risk- 
communications consultant in Princeton, New 
Jersey, advises against any attempt by the WHO 
to “educate” the public out ofits concerns. Asa 
strategy, he says, it “is thoroughly discredited, 
because it doesn’t work”. m SEE EDITORIAL P.439 
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EPIDEMIOLOGY 


Growing pains for 
children’s study 


Door-to-door recruitment abandoned for US project. 


BY MEREDITH WADMAN 


proposed 15% budget cut is making 
A a troubled adolescence at the 

National Children’s Study (NCS), an 
ambitious US government project that aims 
to chart biological, environmental and social 
influences on the health of 100,000 American 
children from before birth to age 21 years. 

The study’s managers at the National 
Institute of Child Health and Human Devel- 
opment (NICHD) in Bethesda, Maryland, 
say that they can cope with the White House's 
budget proposal, released on 13 February. 
This would cut funding for the programme 
by US$28 million, to $165 million in 2013 
(see ‘Belt tightening’). But their plan to save 
money, by recruiting study participants 
through health-care providers rather than by 
door-to-door recruitment, is worrying some 
of the study’s scientists, who already feel shut 
out from its planning. 

In 2010, a year after it started, the NCS’s 
pilot phase had to expand from seven sites 
to 37 after recruitment rates fell well short 
of expectations. As the pilot winds down 
recruitment this year, it has enrolled only 
4,000 subjects. The study, which could be 
used to probe the roots of conditions such as 
asthma, autism and diabetes, must therefore 
accelerate recruitment sharply after its main 
arm launches in 2013. 

NICHD director Alan Guttmacher says 
that there was “understandable angst” among 
study-site directors the day the budget was 
made public. But NCS managers see room 


BELT TIGHTENING 


Facing a proposed 15% cut, the US National 
Children’s Study is seeking cheaper ways to 
recruit its cohort of more than 100,000 children. 
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for savings, estimating that $30 million 
was spent on recruitment in 2011 alone. 
Although door-to-door recruitment is con- 
sidered a gold standard in epidemiology, 
study managers believe that subjects can 
be recruited much more cheaply through 
health-care-providers’ offices, where pilot- 
study data show that recruiters are much 
more likely to find eligible women who are 
pregnant or trying to become pregnant. The 
household recruitment has another down- 
side, says Guttmacher: “Tt would take so long 
it would compromise the study.’ A “scien- 
tifically compelling” study with a budget of 
$165 million is still possible, he says. 

According to one of the study’s principal 
investigators (PIs), however, money is already 
too tight. “The idea that there are cost savings 
to be made here is absolutely absurd,’ says the 
researcher, who contends that many PIs have 
yet to receive funding for their data-manage- 
ment systems that was promised by NICHD 
managers last October. Some are coping by 
diverting funds from other parts of the study; 
others have simply stopped entering data for 
study subjects. The study's managers say that 
the PIs have been adequately funded. 

Some PIs are also worried that recruitment 
at health-care-providers’ offices would bias 
the study and render its findings inapplicable 
to the wider population. They point to a 2008 
Institute of Medicine report that called the 
household-based sampling approach one of 
the study’s main strengths. 

And some scientists complain that they 
had no input into the decision to change the 
recruiting strategy, which many failed to hear 
about even after the budget was announced. 
“We dont have any full, thorough discussion 
of this” says Nigel Paneth, a paediatrician at 
Michigan State University in East Lansing who 
is PI at the NCS site in Wayne County. “What 
this study needs is full scientific input, not 
Bureaucratic Planning Central.” Guttmacher 
notes that government officials cannot talk 
about White House budget proposals before 
they are released. 

But with many congressional districts 
hosting study centres, the programme has 
proved resilient. The administration of 
former president George W. Bush repeat- 
edly tried to cancel it, but Congress always 
restored full funding. m 


SOURCE: NIH 
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Bioethicist Glenn McGee’s new job raised questions of conflict of interest at the journal he founded. 


Editor ’5 move 
sparks backlash 


Bioethicists are forced to consider their purpose as leading 
practitioner joins controversial stem-cell company. 


BY DAVID CYRANOSKI 


he field of bioethics is embroiled in a 

| period of soul-searching, sparked by 

a startling career move by one of its 
biggest names. 

Glenn McGee is the editor-in-chief of the 
American Journal of Bioethics (AJOB), the most 
cited bioethics journal, which he founded in 
1999. Since December 2011, he has also been 
president for ethics and strategic initiatives 
at CellTex Therapeutics in Houston, Texas, a 
controversial company involved in providing 
customers with unproven stem-cell therapies. 
A CellTex press release says that “Dr McGee's 
responsibilities will include ensuring that all of 
the firm’s work, centered on adult stem cells, 
will meet the highest ethical standards of the 
medical and scientific communities.” 

Although McGee has said he will leave 
the journal on 1 March, many bioethicists 
have criticized him, the journal’s editorial 
board and its publisher, London-based Tay- 
lor and Francis. They argue that in holding 
both posts, McGee has a conflict of interest 
between his responsibilities to the journal 
and his new employer's desire to promote the 


clinical application of stem-cell treatments 
that are not approved by the US Food and 
Drug Administration. 

“Imagine if the Editor of the New England 
Journal of Medicine took a job as Vice Presi- 
dent at Merck, and the Mass Medical Society 
asked him to stay on as Editor, opining that 
the conflicts of interest would be manageable. 
One might rightly wonder, ‘What are these 
people smoking?;” says John Lantos, director 
of the Children’s Mercy Bioethics Center in 
Kansas City, Missouri, and a past president 
of the American Society for Bioethics and 
Humanities. 

More broadly, bioethicists are questioning 
whether it can ever be acceptable to work for 
companies, which, they argue, may be using the 
appointment to present a veneer of ethical pro- 
bity. The episode brings to a head concerns that 
have emerged among bioethicists over the past 
decade, says Insoo Hyun, a stem-cell bioethicist 
at Case Western Reserve University in Cleve- 
land, Ohio. “It’s a perfect storm, he says. 

McGee is a leading voice on one side of the 
debate, arguing that bioethics must have prac- 
tical relevance. For the past three years he has 
been chair of bioethics at the non-profit Center 
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for Practical Bioethics in Kansas City, where he 
ran a course for those who might go on to chair 
hospital ethics committees or serve as ethical 
advisers to corporations. 

But during McGee's tenure as editor-in-chief 
of the AJOB, four editors are known to have 
resigned from the editorial board because 
of differences in opinion over how the jour- 
nal handles conflicts of interest. Two left this 
month, including Lantos, who wrote on his 
blog that he will no longer work with the jour- 
nal because of McGee's simultaneous employ- 
ment at theAJOB and CellTex, and frustration 
over the lack ofa clear conflict-of-interest pol- 
icy at the AJOB. In response to Nature’s ques- 
tions about the situation, Taylor and Francis 
responded that it “is grateful for Dr McGee's 
editorship of AJOB” and “supportive of Glenn’s 
decision to step down” 

On 17 February, McGee announced that he 
is merely acting in an advisory capacity at the 
journal until 1 March, when its new editors- 
in-chief take over. They are David Magnus, 
director of the Center for Biomedical Ethics at 
Stanford University, California, and Summer 
Johnson McGee, director of graduate studies 
at the Center for Practical Bioethics and the 
journal's current executive editor. She is also 
Glenn McGee's wife. 

Responding to questions from Nature, Sum- 
mer Johnson McGee says that the journal has 
a conflict-of-interest policy that requires edi- 
tors to withdraw from reviewing a manuscript 
if they perceive a conflict. She calls allegations 
that her appointment results from her relation- 
ship with her husband “baseless and sexist”. 
“David Magnus and I were hired by our pub- 
lisher, not by my husband.” Magnus says that 
at least a dozen editorial board members have 
supported his and Summer Johnson McGee's 
appointments. Two even indicated that Glenn 
McGee should have been able to retain an 
advisory or editorial role. 

Other bioethicists’ blogs and Twitter feeds 
about the episode have expressed concerns, 

however. Leigh 


“Mainstream Turner of the Uni- 
bioethics is versity of Minnesota, 
no longer Minneapolis, called 
speaking truth onthe entire editorial 
to power. e board of the AJOB to 


resign for allowing 
the situation to persist. And many say that 
McGee's move illustrates a broader problem. 
“Mainstream bioethics is no longer speaking 
truth to power,’ complains Jan Helge Solbakk 
at the University of Oslo. “Instead it has become 
the handmaiden of the medico-industrial com- 
plex, and of bioscience and technology.” 

So how should companies get their advice 
on bioethics? Magnus never takes cash from 
industry for advising or speaking — “ma 
hardass about that” — but he believes that 
bioethicists can work for industry as long as 
they give up their academic positions, includ- 
ing posts on journal editorial boards. > 


23 FEBRUARY 2012 | VOL 482 | NATURE | 449 


© 2012 Macmillan Publishers Limited. All rights reserved 


| NEWS | IN FOCUS 


> Working for a respected company 
may be acceptable to some bioethicists, 
but McGee’s new employer comes with a 
great deal of baggage. CellTex, which was 
founded last year and as yet has no website, 
licenses stem-cell technology from Seoul- 
based RNL Bio. The South Korean com- 
pany has made a business out of taking fat 
cells from people, processing them in a way 
that they say increases the number of mes- 
enchymal stem cells, and then reinjecting 
them in an effort to treat conditions such 
as spinal cord injury. 

McGee already had a connection with 
RNL Bio. In 2010, two patients died fol- 
lowing injections of RNL’s cells. McGee, 
working for stem-cell lobby group the 
International Cellular Medicine Society, 
based in Salem, Oregon, helped to conduct 
an investigation into the company. This 
concluded that only one of the two cases 
was likely to be related to the injections, and 
because the patient understood the risk the 
company was not culpable. 

Jin Han Hong, the then president of 
RNLs US subsidiary, admitted in 2010 that 
there was no clinical-trial evidence proving 
that these treatments are effective (Nature 
468, 485; 2010). As treatment with RNLs 
stem cells is not approved in the United 
States or South Korea, for the procedures 
the company sends patients to China or 
Japan, where regulations are less strictly 
enforced. Using RNL'’s methods, CellTex 
is banking stem cells that have gone on to 
be used in a number of patients, including 
Rick Perry, governor of Texas (Nature 477, 
377-378; 2011). CellTex says that it does 
not conduct medical procedures itself. 

When Nature contacted McGee to put 
the criticisms to him, he directed us to pre- 
vious statements indicating that he wants 
to put CellTex on firmer ethical ground by 
having it conduct clinical trials that meet 
standards set by the International Society 
for Stem Cell Research, based in Deerfield, 
Illinois, which represents most mainstream 
stem-cell researchers around the world. 

Hyun warns that working directly for 
business can be fraught with danger, how- 
ever good a bioethicist’s intentions. In 
2005, he helped to craft the informed con- 
sent procedure for egg donations used in a 
cloning procedure by disgraced Korean 
stem-cell scientist Woo Suk Hwang. Follow- 
ing Hwang’s claim, later proved fraudulent, 
that he had cloned human embryos and har- 
vested stem cells from them, it emerged that 
he had ignored the consent procedure for 
egg donations (Nature 438, 536-537; 2005), 
leading to embarrassment for Hyun. 

“T know first hand how difficult it is to 
separate conflict of interest — to maintain 
the role of bioethicist,” says Hyun. “I know 
you need to not be too chummy with enter- 
prises trying to speed ahead in stem cells.” = 
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Researchers who, like Vadim Backman, top $1.5 million in NIH grants will face an extra layer of review. 


Extra scrutiny for 
‘grandee grantees’ 


An analysis by Nature reveals who holds the most grants 
from the US National Institutes of Health. 


BY ERIC HAND 


adim Backman no longer relies on cof- 
\ fee to get him through the 100-hour 
weeks he puts in at his biomedical 
engineering laboratory at Northwestern Uni- 
versity in Evanston, Illinois. Since giving up 
caffeine, he drops to the floor and does press- 
ups whenever he needs to clear his head. It 
certainly takes an alert mind to supervise 
20 students, collaborate on clinical trials at 
8 hospitals worldwide, and manage 7 grants 
worth a total of more than US$3 million from 
the US National Institutes of Health (NIH) in 
Bethesda, Maryland. 

At 38, Backman is already a biomedical 
superstar. He is developing an imaging tech- 
nology that could detect abnormal structures 
in cells during the earliest stages of cancer. 
And a Nature analysis has identified him as 
one of seven scientists whom the NIH sup- 
ports with the most grants (see ‘Seven lucky 
sever’). That puts him near the top ofa larger 
group of NIH-supported researchers who will 
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soon be targeted for extra scrutiny beyond the 
peer-review process. 

As it released its 2013 budget proposal last 
week, the agency said that researchers who 
control more than $1.5 million in grants will 
undergo an extra layer of review from exter- 
nal advisers before further grants are approved. 
The decision comes as the agency tries to 
scrape money together for new grants in order 
to raise its current grant success rate of 18%, a 
historic low. But the countermeasure — poten- 
tially penalizing applicants on the basis of their 
previous success — is also historic. 

The basic rule for giving out grants at the 
NIH has always been simple: to fund the best 
science. A retreat from pure meritocracy is 
“shocking”, says Howard Garrison, director 
of public affairs at the Federation of Ameri- 
can Societies for Experimental Biology in 
Bethesda, Maryland. “It’s a huge sea change.” 

Nevertheless, Garrison supports the new 
rule because he is concerned about the vast 
number of researchers who are struggling to 
win, or hold, just one grant. 


S. RYAN/NORTHWESTERN UNIV. 


Nearly 1,500 principal investigators (PIs) 
— about 5% of those who held grants in 2011 
— come in above the $1.5-million threshold 
and would be subject to the review. A $750,000 
threshold for a similar layer of extra review has 
been in place since the 1990s at one NIH insti- 
tute, the National Institute of General Medical 
Sciences, and has worked well, says the insti- 
tute’s former director Jeremy Berg, now at the 
University of Pittsburgh in Pennsylvania. 


SHOPPING AROUND 

Sally Rockey, the NIH deputy director for 
extramural research, says the agency isn’t con- 
sidering a hard cap based on the number of 
grants per scientist, nor extra review for those 
with many grants. She points out that a cap 
based on numbers of grants would have to be 
draconian to spread grants to a significantly 
greater number of researchers. An analysis she 
presented on her blog in October 2011 found 
that setting a maximum of two grants per 
PI would increase the grant success rate by 
just 2%. 

In 2008, two NIH advisory panels tasked 
with reforming the peer-review process for 
grants recommended that PIs spend at least 
20% of their time on any given grant — a 
de facto cap of five grants per researcher. 
Although most of the recommendations were 
ultimately adopted, the 20% rule was not. Berg, 
who was on one of the advisory panels, says 
he would still support a review threshold — 
although not a hard cap — for a certain num- 
ber of grants. “You look at people with more 
than a certain number of grants and ask, ‘Is 
this a good investment for the NIH?” he says. 
There are concerns, he adds, that PIs could 
gain multiple grants by presenting similar 
experiments to different NIH institutes. 

Berg has tried to measure the output of labo- 
ratories of different sizes, and found that the 
richest are not necessarily the most produc- 
tive (see Nature 468, 356-357; 2010). “There 
are some people who are definitely capable of 
running bigger operations while maintaining 
tremendous productivity per dollar,” he says. 
“There are other people who are very well 
funded and aren't so productive.” 
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Those questions are especially important for 
the very top grant winners, whom Nature iden- 
tified on the basis of ‘research project grants, 
an NIH-defined category composed mostly of 
the R-01 grants that provide bread-and-butter 
support to most PIs. 

John Tainer, a structural biologist at the 
Scripps Research Institute in La Jolla, Califor- 
nia, feels that the new rule will only further 
entrench a bias against those with multiple 
grants, and worries that it could restrict inno- 
vation by the elite. With 7 grants worth a com- 
bined sum of more than $5 million, it is hard 
to feel sorry for Tainer. But earlier this month, 
he lost a competitive renewal for a grant that 
he has held since 1985, to study the hair-like 
pili on the surface of bacteria that make them 
sticky and contribute to their pathogenicity. 
Because he relies on grants to pay the salaries 
of 18 lab members, as well as his own, this 
rejection could mean lay-offs. 

Tainer suspects that the decision “reflects 
the fact that I have other projects” But, he con- 

tinues, “The science 


“You look at hasn't changed. What 
people with more we're doing now is 
than a certain Destes Wign ige 
juniers f we've ever done. 

The loss of the 
grants and ask, grant will extend 
Pai canes beyond his own lab, 


he adds. “For the next 
decade, people will 
be publishing parts of things that I had done 
better. The cost to the NIH will be higher. If 
you're a leader and you have momentum and 
technology, the impact of taking that away and 
having other people do it at a different level is 
destructive.” 

Backman also dislikes the idea of capping 
the number of grants that an individual can 
win, but is more relaxed about the proposed 
$1.5-million threshold review. He is sympa- 
thetic to the plight of young researchers casting 
about for their first grant — he was in the same 
position just a few years ago — but says that the 
competition for established researchers must 
be based purely on the strength of their ideas. 
“T like the idea of meritocracy,’ he says. m 


SEVEN LUCKY SEVEN 
Seven NIH-supported researchers are principal investigators on seven research project grants each. 
Name Grant total Institution Research 
Ronald Davis $6,986,908 Stanford University Genomics 
John Tainer $5,069,800 Scripps Research Institute Structural biology 
Anjana Rao $3,512,571 La Jolla Institute for Allergy Signalling and gene expression 
& Immunology 
George Koob $3,365,229 Scripps Research Institute Neurobiology of addiction 
Vadim Backman $3,054,165 orthwestern University Biophotonics 
Pier Pandolfi $2,929,857 Beth Israel Deaconess Tumorigenesis 
Medical Center 
Pietro Sanna $2,114,278 Scripps Research Institute Neurobiology of addiction 
Fiscal 2011 grant data were used. Grant totals reflect fractional shares of multi-Pl grants. Analysis excludes grants made to 


large research centres. Grant supplements are included as part of original grant, rather than as a separate award. 
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FERMILAB 


HIGH-ENERGY PHYSICS 
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Physicists raid Tevatron for parts 


Fermilab icon plundered amid tight budgets and shifting scientific aims. 


BY EUGENIE SAMUEL REICH 


stories high, chock full of particle detectors, 

power supplies, electronics and photo- 
multiplier tubes, all layered like a giant onion 
around a cylindrical magnet. During 26 years 
of operation at the Fermi National Accelerator 
Laboratory in Batavia, Illinois, this behemoth, 
the Collider Detector at Fermilab (CDF), 
helped to find the top quark and chased the 
Higgs boson. But since the lab’s flagship parti- 
cle collider, the Tevatron, was switched off in 
September 2011, the detector has been surplus 
stock — and it is now slowly being cannibal- 
ized for parts. 

When the Tevatron closed, Fermilab 
announced that the CDF would become an 
educational display. Along with its companion 
experiment, DO, the detector was supposed to 
form the centrepiece of a tour through simu- 
lated control rooms and decommissioned 
accelerator tunnels. But tight budgets for 
experimental particle physicists — combined 
with their tendency to tinker and recycle — are 
pushing the outcome in a different direction, 
at least for the CDE 

“Some parts are worth pennies, but in this 
budgetary climate, even pennies are worth sav- 
ing,” says Rob Roser, who until recently was 
co-spokesman for the CDF and has now him- 
self been recycled into a new position as head 
of scientific computing at Fermilab. 

“Recycling equipment is as old as science 
itself” says Jonathan Lewis, the Fermilab 
scientist in charge of decommissioning the 


E is a 4,000-tonne edifice that stands three 


The CDF, one of the Tevatron’s two detectors, is 
slowly surrendering its parts to other experiments. 


CANNIBALIZING THE TEVATRON 


Parts at the Tevatron and its two main experiments, the CDF and DO, are being 
considered for recycling in the wake of the collider’s closure in September 2011. 


Main injector 
and recycler 


Beams repurposed 
for neutrino and 
other experiments 


Tevatron 


500 m 


CDE And thrift is in fashion. With most of the 
action in particle physics taking place at the 
Large Hadron Collider near Geneva in Swit- 
zerland, US researchers were preparing for hard 
times even before US President Barack Obama 
released his 2013 budget request on 13 Febru- 
ary (see Nature 482, 283-285; 2012). Although 
the Office of Science at the Department of 
Energy (DOE) got a 2.4% funding boost, the 
budget cut Fermilab’s allotment by 5% and 
DOE funding for high-energy physics by 1.8%. 

Looking for savings, Bogdan Wojtsek- 
howski, who leads three experiments to probe 
nuclear structure with an electron beam at 
the DOE’s Jefferson Lab in Newport News, 
Virginia, convinced Fermilab to send him 600 
photomultiplier tubes, which capture the light 
emitted as particles streak through detector 
materials. Buying them new would have cost 
$600,000. 

The recycling reflects not only parsimony, but 
also a programmatic shift in US particle physics. 
With the shutdown of the Tevatron, researchers 
moved from the energy frontier, where physics 
is tested with particle collisions at the highest 
energies, to the intensity frontier, where the 
highest numbers of particles are collided. The 
fate of the CDF’s parts mirrors this switch. 

Some power supplies are going to Mu2e, an 
intensity-frontier experiment at Fermilab to 


Antiproton 


source 
Booster 
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* Central magnet for a 
possible particle-decay 
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DO 


look for the rare conversion of muons to elec- 
trons. Amplifier chips are going to g-2, another 
intensity-frontier experiment at Fermilab 
to measure a key magnetic parameter of the 
muon. Some scintillation materials, which emit 
light along the path of charged particles pass- 
ing through them, are destined for the Long- 
Baseline Neutrino Experiment, which plans to 
send neutrinos 1,300 kilometres from Fermilab 
to the Homestake Mine in Lead, South Dakota. 

Although most of the donations involve 
small items that would not stop the CDF from 
going on display, the most ambitious recycling 
request so far would see it gutted. A proposed 
experiment called ORKA, which would search 
for a predicted, but as yet unobserved, rare 
decay of kaon particles, needs a massive sole- 
noid magnet like the one at the CDF’s heart. 
ORKA has yet to be funded by the DOE, but 
the Physics Advisory Committee at Fermilab 
approved its scientific goals in December 2011. 

Robert Tschirhart, co-spokesman for 
ORKA, says that adapting the magnet from the 
CDF by replacing some of its present detectors 
with a kaon detector may cost several million 
dollars. That would still be about half the cost 
of buying a new magnet. 

Lab management will make a decision about 
whether ORKA can eviscerate the CDF in 
about 6 months’ time. m 
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PALAEOBOTANY 


Wild flower blooms again 
after 30,000 years on ice 


Fruits hoarded by ancient ground squirrels give new life to prehistoric plants. 


BY SHARON LEVY 


uring the Ice Age, Earth’s northern 
D reaches were covered by chilly, arid 

grasslands roamed by mammoths, 
woolly rhinoceros and long-horned bison. That 
ecosystem, known by palaeontologists as the 
mammoth steppe, vanished about 13,000 years 
ago. It has no modern counterpart. 

Yet one of its plants has reportedly been res- 
urrected by a team of scientists who tapped a 
treasure trove of fruits and seeds, buried some 
30,000 years ago by ground squirrels and pre- 
served in the permafrost (S. Yashina et al. Proc. 
Natl Acad. Sci. USA http://dx.doi.org/10.1073/ 
pnas.1118386109; 2012). The plant would be 
by far the most ancient ever revived; the previ- 
ous record holder was a date palm grown from 
seeds roughly 2,000 years old. 

The squirrels’ burrows, 70 in all, were found 
on the banks of the lower Kolyma River in 
northeastern Siberia, 20-40 metres below the 
current surface of the tundra and surrounded 
by the bones of mammoths and other crea- 
tures. Some burrows contained hundreds of 
thousands of fruits and seeds, wonderfully 
preserved by the cold, dry environment. 

Researchers had previously attempted to 
grow plants from seeds found in these ancient 
burrows, including sedge, Arctic dock, alpine 
bearberry and the herbaceous plant Silene 
stenophylla. Those seeds did begin to germi- 
nate, but then faltered and died back. 

Tantalized, David Gilichinsky of the Russian 
Academy of Sciences’ Institute of Physicochem- 
ical and Biological Problems in Soil Science in 
Pushchino decided to try a different approach 
(sadly, Gilichinsky passed away last week). He 
and his colleagues took samples of placental 
tissue from S. stenophylla fruits. The plant pla- 
centa — an example of which is the white matter 


A prehistoric plant resurrected from frozen tissue. 


inside a bell pepper — gives rise to and holds 
the seeds. The tissue produced shoots when it 
was cultivated in vitro, and the scientists used 
these to propagate more plants. They are the 
oldest living multicellular organisms on Earth, 
the team says. 

The plants have already blossomed to pro- 
duce fertile seeds, which were grown into a 


second generation of fertile plants. During 
propagation, the ancient form of the wild 
flower produced more buds but was slower 
to put out roots than modern S. stenophylla, 
which is found along the banks of the Kolyma. 
This suggests that the original has a distinct 
phenotype, adapted to the extreme environ- 
ment of the Ice Age. 

“Tm excited that someone has finally suc- 
ceeded in doing this,” says Grant Zazula of 
the Yukon Palaeontology Program in White- 
horse, Canada, who has investigated previous 
claims of ancient seed germination. “There is 
a good chance that extinct plant species could 
now be brought back to life from permafrost- 
preserved seeds,” 

Although some members of the mam- 
moth steppe ecosystem survive, no place on 
Earth currently holds the same combination 
of grasses, sedge and wild flowers that have 
been found in the mummified guts of Ice Age 
mammoths or in the frozen hoards of squirrels 
(B. V. Gaglioti et al. Quatern. Res. 76, 373-382; 
2011). Zazula speculates that living plant tissue 
from much earlier — hundreds of thousands 
of years ago — might also be revived, revealing 
evolutionary change over a longer timescale, 
and helping scientists to understand the lost 
ecology of periods such as the Ice Age. m 


CLARIFICATION 

The table in the News story ‘Obama shoots 
for science increase’ (Nature 482, 283-285; 
2012) was unclear about the make-up of 
the Food and Drug Administration’s budget. 
Obama’s request leaves the government’s 
input nearly flat, but a rise in user fees from 
industry would lift the agency’s overall 2013 
budget to $4,486 million. 
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LEGACY OF AUNIVERSAL MIND 


23 June 1912 — Alan Mathison 

Turing seemed destined to solitude, misun- 
derstanding and persecution (see page 441). As 
his centenary year opens, Nature hails him as one 
of the top scientific minds of all time (see page 
440). This special issue sweeps through Turing’s 
innumerable achievements, taking us from his 
most famous roles — wartime code-breaker and 
founder of computer science (see page 459) — to 
his lesser known interests of botany, neural nets, 
unorganized machines, quantum physics and, 
well, ghosts (see page 562). 

Everyone sees a different Turing. A molecu- 
lar biologist might surprise you by saying that 
Turing’s most important paper is his 1936 work 
on the “Turing machine’ because of its rel- 
evance to DNA-based cellular operations (see 
page 461). A biophysicist could instead point 
to his 1952 work on the formation of biological 
patterns — the first simulation of nonlinear 

dynamics ever to be pub- 


Fe: the day he was born — 


DNATURECOM lished (see page 464). 

For more on Beneath it all, Turing 
Turing, see: was driven by the dream of 
nature.com/turing reviving — possibly in the 


BY TANGUY CHOUARD 


form of a computer program — 
the soul of Christopher Morcom, 
perhaps his only true friend, who died abruptly 
when they were both teenagers. I want to “build 
a brain’, he said. So does electrophysiologist 
Henry Markram (see page 456). But it is still a 
matter of debate whether machine intelligence 
should faithfully simulate neuronal circuitry, 
or just emulate brain function using whatever 
expedient (see page 462). 

Even when Turing was kept busy by wartime 
code-breaking and the practical implementa- 
tion of his universal computer, he never forgot 
that he had, in 1936, discovered something even 
bigger: the ‘incomputable’ world. Contempo- 
rary physics hasnt even started to work out the 
implications of that discovery (see page 465). 

It is typical of Turing’s brilliance and play- 
fulness that even as he gave so many fields the 
tools that allowed them to blossom, he planted 
a concept that pushes science as we know it 
— physical reality and Newtonian causality— 
towards the abyss. = 


Tanguy Chouard, a biology editor at Nature, 
was the consulting editor for this special issue. 


ANDY POTTS; TURING FAMILY 
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——— — ——— = t wasn't quite the lynching that Henry Markram 
» A ; | ' had expected. But the barrage of sceptical com- 
| | | | ments from his fellow neuroscientists — “It’s 
| | | | crap,’ said one — definitely made the day feel 
> | | \ - like a tribunal. 
4 Officially, the Swiss Academy of Sciences 
meeting in Bern on 20 January was an overview 
of large-scale computer modelling in neuro- 
science. Unofficially, it was neuroscientists’ first 
real chance to get answers about Markram’s controversial proposal for the 
Human Brain Project (HBP) — an effort to build a supercomputer simu- 
lation that integrates everything known about the human brain, from the 
structures of ion channels in neural cell membranes up to mechanisms 
behind conscious decision-making. 

Markram, a South-African-born brain electrophysiologist who joined 
the Swiss Federal Institute of Technology in Lausanne (EPFL) a decade 
ago, may soon see his ambition fulfilled. The project is one of six finalists 
vying to win €1 billion (US$1.3 billion) as one of the European Union's 
two new decade-long Flagship initiatives. 

“Brain researchers are generating 60,000 papers per year,’ said 


Henry Markram wants €1 billion Markram as he explained the concept in Bern. “They're all beauti- 
to mo d el th e entire hum an brain ful, fantastic studies — but all focused on their one little corner: this 


molecule, this brain region, this function, this map” The HBP would 

S ceptic Ss don E think he should get it. integrate these discoveries, he said, and create models to explore how 

neural circuits are organized, and how they give rise to behaviour and 

BY M. MITCHELL WALDROP cognition — among the deepest mysteries in neuroscience. Ultimately, 

said Markram, the HBP would even help researchers to grapple with 

disorders such as Alzheimer’s disease. “If we don't have an integrated 
~~ TURING AT 100 view, we wont understand these diseases,’ he declared. 

As the response at the meeting made clear, however, there is deep 

unease about Markram’s vision. Many neuroscientists think it is ill- 

conceived, not least because Markram’s idiosyncratic approach to brain 


» A legacy that spans science: 


nature.com/turing 
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simulation strikes them as grotesquely cumbersome and over-detailed. 
They see the HBP as overhyped, thanks to breathless media reports 
about what it will accomplish. And they're not at all sure that they can 
trust Markram to run a project that is truly open to other ideas. 

“We need variance in neuroscience,’ declared Rodney Douglas, 
co-director of the Institute for Neuroinformatics (INI), a joint initiative 
of the University of Zurich and the Swiss Federal Institute of Technology 
in Zurich (ETH Zurich). Given how little is known about the brain, he 
said, “we need as many different people express- 
ing as many different ideas as possible” — a 
diversity that would be threatened if so much 
scarce neuroscience research money were to be 
diverted into a single endeavour. 

Markram was undeterred. Right now, 
he argued, neuroscientists have no plan for 
achieving a comprehensive understanding of 
the brain. “So this is the plan,” he said. “Build 
unifying models.” 


MARKRAW’S BIG IDEA 

Markram has been on a quest for unity since 
at least 1980, when he began undergraduate 
studies at the University of Cape Town in South Africa. He abandoned 
his first field of study, psychiatry, when he decided that it was mainly 
about putting people into diagnostic pigeonholes and medicating them 
accordingly. “This was never going to tell us how the brain worked,” he 
recalled in Bern. 

His search for a new direction led Markram to the laboratory of 
Douglas, then a young neuroscientist at Cape Town. Markram was 
enthralled. “I said, “That's it! For the rest of my life, ’'m going to dig into 
the brain and understand how it works, down to the smallest detail we 
can possibly find?” 

That enthusiasm carried Markram to a PhD at the Weizmann Institute 
of Science in Rehovot, Israel; to postdoctoral stints at the US National 
Institutes of Health in Bethesda, Maryland, and at the Max Planck Insti- 
tute for Medical Research in Heidelberg, Germany; and, in 1995, toa 
faculty position at Weizmann. He earned a formidable reputation as an 
experimenter, notably demonstrating spike-timing-dependent plasticity 
— in which the strength of neural connections changes according to when 
impulses arrive and leave (H. Markram et al. Science 275, 213-215; 1997). 

By the mid-1990s, individual discoveries were leaving him dissatisfied. 
“Trealized I could be doing this for the next 25, 30 years of my career, and 
it was still not going to help me understand how the brain works, he said. 

To do better, he reasoned, neuroscientists would have to pool their 
discoveries systematically. Every experiment at least tacitly involves a 
model, whether it is the molecular structure of an ion channel or the 
dynamics ofa cortical circuit. With computers, Markram realized, you 
could encode all of those models explicitly and get them to work together. 
That would help researchers to find the gaps and contradictions in their 
knowledge and identify the experiments needed to resolve them. 

Markram’ insight wasn’t original: scientists have been devising math- 
ematical models of neural activity since the early twentieth century, and 
using computers for the task since the 1950s (see page 462). But his ambi- 
tion was vast. Instead of modelling each neuron as, say, a point-like node 
ina larger neural network, he proposed to model them in all their multi- 
branching detail — down to their myriad ion channels (see ‘Building 
a brain’). And instead of modelling just the neural circuits involved in, 
say, the sense of smell, he wanted to model everything, “from the genetic 
level, the molecular level, the neurons and synapses, how microcircuits are 
formed, macrocircuits, mesocircuits, brain areas — until we get to under- 
stand how to link these levels, all the way up to behaviour and cognition”. 

The computer power required to run such a grand unified theory 
of the brain would be roughly an exaflop, or 10° operations per sec- 
ond — hopeless in the 1990s. But Markram was undaunted: available 
computer power doubles roughly every 18 months, which meant that 
exascale computers could be available by the 2020s (see ‘Far to go). 


“IT WILL BE LOTS OF 

EINSTEINS COMING 

TOGETHER 10 BUILD 
A BRAIN.” 
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And in the meantime, he argued, neuroscientists ought to be getting 
ready for them. 

Markram’s ambitions fit perfectly with those of Patrick Aebischer, a 
neuroscientist who became president of the EPFL in 2000 and wanted to 
make the university a powerhouse in both computation and biomedical 
research. Markram was one of his first recruits, in 2002. “Henry gave 
us an excuse to buy a Blue Gene,’ says Aebischer, referring to a then- 
new IBM supercomputer optimized for large-scale simulations. One 
was installed at the EPFL in 2005, allowing 
Markram to launch the Blue Brain Project: his 
first experiment in integrative neuroscience 
and, in retrospect, a prototype for the HBP. 

Part of the project has been a demonstra- 
tion of what a unifying model might mean, 
says Markram, who started with a data set 
on the rat cortex that he and his students 
had been accumulating since the 1990s. It 
included results from some 20,000 experi- 
ments in many labs, he says — “data on about 
every cell type that we had come across, the 
morphology, the reconstruction in three 
dimensions, the electrical properties, the 
synaptic communication, where the synapses are located, the way the 
synapses behave, even genetic data about what genes are expressed”. 

By the end of 2005, his team had integrated all the relevant portions 
of this data set into a single-neuron model. By 2008, the researchers had 
linked about 10,000 such models into a simulation of a tube-shaped 
piece of cortex known as a cortical column. Now, using a more advanced 
version of Blue Gene, they have simulated 100 interconnected columns. 

The effort has yielded some discoveries, says Markram, such as the 
as-yet unpublished statistical distribution of synapses in a column. But its 
real achievement has been to prove that unifying models can, as promised, 
serve as repositories for data on cortical structure and function. Indeed, 
most of the team’s efforts have gone into creating “the huge ecosystem of 
infrastructure and software” required to make Blue Brain useful to every 
neuroscientist, says Markram. This includes automatic tools for turning 
data into simulations, and informatics tools such as http://channelpedia. 
net — a user-editable website that automatically collates structural data 


The Blue Brain simulation — a prototype for the Human Brain Project — 
constructs simulated sections of cortex from the bottom up, starting from 
detailed models of individual neurons. 
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The Blue Brain Project has steadily increased the scale of its cortical simulations 
through the use of cutting-edge supercomputers and ever-increasing memory 
resources. But the full-scale simulation called for in the proposed Human Brain 
Project (red) would require resources roughly 100,000 times larger still. 
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onion channels from publications in the PubMed database, and currently 
incorporates some 180,000 abstracts. 

The ultimate goal was always to integrate data across the entire brain, 
says Markram. The opportunity to approach that scale finally arose in 
December 2009, when the European Union announced that it was pre- 
pared to pour some €1 billion into each of two high-risk, but potentially 
transformational, Flagship projects. Markram, who had been part of the 
27-member advisory group that endorsed the initiative, lost no time in 
organizing his own entry. And in May 2011, the HBP was named as one 
of six candidates that would receive seed money and prepare a full-scale 
proposal, due in May 2012. 

If the HBP is selected, one of the key goals will be to make it highly 
collaborative and Internet-accessible, open to researchers from around 
the world, says Markram, adding that the project consortium already 
comprises some 150 principal investigators and 70 institutions in 
22 countries. “It will be lots of Einsteins coming together to build a 
brain,’ he says, each bringing his or her own ideas and expertise. 


BOTTOM TO TOP 

The description of the HBP as an open user facility sparked interest and 
enthusiasm at the Bern meeting. But much more vocal were Markram's 
critics, many of whom focused on the perceived inadequacies of the 
Blue Brain model — and of Markram’s approach to data integration. 

At the heart of that approach is Markram’s conviction that a good 
unifying model has to assimilate data from the bottom up. In his view, 
modellers should start at the most basic level — he focuses on ion chan- 
nels because they determine when a neuron fires — and get everything 
working at one level before proceeding to the next. This requires a lot 
of educated guesses, but Markram argues that the admittedly huge gaps 
in knowledge about the brain can be filled with data as experiments are 
published — the Blue Brain model is updated once a week. The alterna- 
tive approach, approximating and abstracting away the biological detail, 
leaves no way to be sure that the model's behaviour has anything to do 
with how the brain works, said Markram. 

This is where other computational neuroscientists gnash their teeth. 
Most of them are already using simple models of individual neurons 
to explore high-level functions such as pattern recognition. Markram’s 
bottom-up approach risks missing the wood for the trees, many of them 
argued in Bern: the model could be so detailed that it is no easier to 
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understand than the real brain. And that is if Markram can build it at all. 
Judging by what Blue Brain has accomplished in the past six years, critics 
said, that seems unlikely. The tiny swathe of simulated rat cortex has no 
inputs from sensory organs or outputs to other parts of the brain, and 
produces almost no interesting behaviour, pointed out Kevan Martin, 
co-director of the INI, in an e-mail. It is “certainly not the case” that 
Markram has simulated the column as it works in a whole animal, he said. 

Markram’s response to such criticisms in Bern was that more capa- 
bilities are always being added to the Blue Brain simulation. But Martin 
remained unimpressed. “I cannot imagine how this level of detail, which 
is still very incomplete even after Henry's considerable labours, is ever 
going to be obtained from more than a few regions of the rodent brain 
in the next decade, let alone brains of Drosophila, zebrafish, songbird, 
mouse or monkey,’ his e-mail continued. 

“Of course,’ Martin added, “all this would be but a storm in the 
professors teacups” if the HBP hadn't come along and raised the stakes 
enormously. It is all too easy to imagine other areas of neuroscience 
research being starved for resources by the HBP — especially in Switzer- 
land, which as host country will have to provide a substantial, but still- 
undetermined, fraction of the funding. Douglas asks: should Europe 
be spending €1 billion to support the passionate quest of one man? He 
concedes that visionaries are sometimes necessary to drive progress. 
“But what if they're passionately wrong?” 

Also fuelling anxiety — and irritation — is the widespread sense that 
Markram has been making his case through the news media, not through 
publishing, conferences and the other conventional channels of science. 
Reporters see much to like: Markram is tall, striking and explains his 
ideas with the clarity, quotability and urgency of a South African version 
of the late Carl Sagan. He has “a hypnotic effect’; says Richard Hahnloser, 
a computational neuroscientist at the INI. But critics say that this results 
in too many news accounts that leave the impression that the HBP will, 
say, eliminate the need for experimental animals. 

“The whole neuroscience community will be in trouble ten years from 
now” when the implied predictions don’t come true, says another INI 
researcher, who worries that the politicians will be right there saying, 
“But you promised!” 


MARCH OF PROGRESS 

In Bern, Markram bristled at accusations that he has deliberately 
cultivated hype. “I have never said that the HBP would replace animal 
experiments,” he shot back at one questioner. “I said that simulation 
helps you choose the experiments that will best add value.” 

Markram was also at pains to insist that the HBP will be open to other 
modelling approaches. “This concern is unfounded because they simply 
have not bothered to find out what is being proposed,’ he told Nature 
after the meeting. The final facility “will allow anyone to build models 
at a range of levels of biological detail with as much data as possible 
from anywhere”. 

Markram seems to be building support. Last year, the board that over- 
sees both the ETH and the EPFL enthusiastically endorsed the Blue Brain 
Project after a rigorous review by a four-member panel that included two 
outspoken sceptics of Markram’s approach. The board asked the Swiss 
parliament to commit 75 million Swiss francs (US$81 million) to the 
project for 2013-16 — more than ten times Blue Brain’s current budget. 
Parliament's decision is expected next month. 

Markram is optimistic that the European Union will come to much the 
same conclusion about the HBP. However, if the project isn't endorsed, 
says Markram, “we'll just continue with Blue Brain’ — although it may 
take a lot longer to reach a full brain simulation. 

Markram clearly feels that history is on his side. “Simulation-based 
research is an inevitability,’ he declared in Bern. “If I get stopped from 
doing this, it’s going to happen. It has happened already in many areas 
of science. And it is going to happen in life science.” m 


M. Mitchell Waldrop is a features editor for Nature based in 
Washington DC. 
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Using techniques developed by Alan Turing, the Colossus was able to decode German wartime communications. 


The dawn of computing 


Alan Turing’s bridging of logic and machines laid the foundation for 
digital computers, says George Dyson. 


he history of digital computing can 
TT divided into an old testament and 
anew testament. The prophets of the 
old testament, led by Gottfried Wilhelm 
Leibniz in the 1670s, supplied the logic; 
those of the new testament, led by John von 
Neumann in the 1940s, built the machines. 
Alan Turing, born on 23 June 1912, falls in 
between. His paper ‘On computable num- 
bers, with an application to the Entschei- 
dungsproblem,, written in 1936 while he was 
a fellow at the University of Cambridge’s 
King’s College, UK, and published shortly 
after his arrival as a graduate student at 
Princeton University, New Jersey, in October 
1936, led the way to the implementation of 
mathematical logic in machines’. 
Turing was aiming to solve German 
mathematician David Hilbert’s 1928 


Entscheidungsproblem — the ‘decision 
problem’ of whether a mechanical proce- 
dure could determine the validity of any 
logical statement in a finite number of steps. 
Turing took the 1930s concept of a computer 
— a person equipped with pencil, paper 
and instructions — and deconstructed it by 
removing all traces of intelligence except for 
the ability to follow instructions and read 
and write a finite alphabet of symbols on an 
unbounded paper tape. 

The result was the Turing machine: a 
mathematical black box that obeys preset 
instructions, represented by symbols encoded 
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on the tape or stored in the machine’ internal 
‘state of mind. At any moment, the machine 
can read, write or erase a symbol from a 
square; move a square to the right or left; or 
change its state of mind. Complex symbols 
can be represented by strings of simpler ones, 
the limit being the binary distinction between 
two symbols (or the presence or absence ofa 
hole in the tape). These ‘bits’ of information 
can take two forms: patterns in space that are 
transmitted across time, termed memory; or 
patterns in time that are transmitted across 
space, called code. For a Turing machine, time 
exists not as a continuum, but as a sequence 
of changes of state. 

Turing then demonstrated the existence 
ofa single machine that could “compute any 
computable” sequence’. Such a ‘universal 
computing machine’ could mimic any > 
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> other machine by executing an encoded 
description of it. Thus, he foresaw the 
concept of software. 

Finally, Turing answered Hilbert’s 
conundrum. He identified a question that 
could not be answered by any machine ina 
finite number of steps: will a given encoded 
description come to a halt or run forever 
when executed by the universal computing 
machine? The answer to the Entscheidung- 
sproblem was, therefore, no. 

“You can build an organ which can do 
anything that can be done,’ explained von 
Neumann, paraphrasing Turing, in a lec- 
ture in 1949, “but you cannot build an organ 
which tells you whether it can be done”’. 
Sensing the limits of deterministic machines, 
Turing began to explore non-deterministic 
computation by ‘oracle’ machines. These 
proceed step-by-step, but occasionally make 
unpredictable leaps by consulting “a kind of 
oracle as it were”’. 


CODE BREAKING 

Having completed his PhD, Turing returned 
to England in July 1938. The outbreak of the 
Second World War soon sparked demand for 
his ideas, and he was sequestered at the Gov- 
ernment Code & Cypher School at Bletch- 
ley Park. There, Turing and his colleagues, 
including his mentor, topologist Maxwell 
‘Max’ Newman, deciphered enemy commu- 
nications, including messages encrypted by 
the German Enigma machine — a Turing 
machine with an internal mechanism that 
shifted through 10” possible configurations 
to scramble the input text. 

Starting with a set of electromechanical 
devices called bombes, each of which could 
emulate 36 suspected Enigma configurations 
at a time, the researchers at Bletchley Park, 
assisted by engineer Thomas Flowers at the 
General Post Office Research Station in Dol- 
lis Hill, London, developed a machine called 
Colossus — a sophisticated electronic digi- 
tal computer. A 1,500-vacuum-tube internal 
memory provided Colossus with a program- 
mable state of mind that searched for clues 
in coded sequences scanned from punched 
paper tape. 

Colossus was swiftly improved and 
duplicated, producing a second generation 
of 2,400-tube machines that influenced the 
outcome of the war and the development of 
modern computers, although Britain's Offi- 
cial Secrets Act kept the details embargoed 
for more than 30 years. When the war ended, 
the push for more powerful computers shifted 
from cryptanalysis to the design of nuclear 
weapons, and the United States, which had 
declassified its wartime computer, the ENIAC 
(Electronic Numerical Integrator and Com- 
puter), in February 1946, assumed the lead. 

At the Institute for Advanced Study in 
Princeton, and with funding from the US 
Army, the Office of Naval Research and 


the US Atomic Energy Commission, von 
Neumann set out to build an electronic 
version of Turing’s universal computing 
machine. He wanted a Turing machine with 
a memory that was accessible at the speed 
of light, and he decided to build it him- 
self. The US government wanted to know 
whether a hydrogen bomb was feasible, so 
von Neumann promised it a machine, with 
5 kilobytes of storage, that could run the 
required hydrodynamic codes. The com- 
puter’s design was made public so that copies 
could be freely duplicated — and commer- 
cialized by IBM. “Words coding the orders are 
handled in the memory just like numbers,” 
von Neumann announced at the first project 
meeting, on 12 November 1945 (ref. 4). This 
mingling of data and instructions was central 
to Turing’s model. Most computers today are 
the direct offspring, in terms of their logical 
architecture, of a Turing machine built from 
war-surplus components in an outbuilding on 
a former New Jersey farm. 

Turing and von Neumann first met in 
Cambridge in 1935, and subsequently spent 
two years together in Princeton, where 
Newman joined them for 6 months. How 

much Turing and 
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built from Turing was in 


the United States 
between November 
1942 and March 
1943, and that von 
Neumann was in England between February 
and July 1943. During the war, British phys- 
icists, in consultation with von Neumann, 
made important contributions to the atomic 
bomb project at Los Alamos in New Mexico; 
and US cryptanalysts, in consultation with 
Turing, contributed to the effort at Bletchley 
Park. Although they could not communicate 
them openly in writing, Turing, von Neu- 
mann and Newman probably shared their 
ideas verbally, both during and after the war. 
Turing’s model was one-dimensional: a 
string of symbols encoded on a tape. von 
Neumann's implementation was two-dimen- 
sional: the random-access address matrix 
that underlies most computers today. The 
Internet — many Turing machines with con- 
current access to a shared tape — has made 
the landscape three-dimensional. Yet the 
way in which computers work has remained 
fundamentally unchanged since 1946. 


war-surplus 
components.” 


LEARNING FROM MISTAKES 

Both Turing and von Neumann were con- 
scious of processing errors in their machines. 
Early codes could be fully debugged, but the 
hardware was more unreliable, giving incon- 
sistent results — a problem that has since 
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been reversed. Both men knew that biology 
relied on statistical, fault-tolerant methods 
for processing information (such as pulse- 
frequency coding in the brain) and assumed 
that technology would follow nature’s lead. If 
“every error has to be caught, explained and 
corrected, a system of the complexity of the 
living organism would not run fora millisec- 
ond” von Neumann commented’. 

“If a machine is expected to be infallible, 
it cannot also be intelligent,” Turing noted 
in 1947 (ref. 6). When Turing joined New- 
man's group at the University of Manchester, 
UK, the following year and began designing 
the Manchester Mark 1 (a prototype for the 
Ferranti Mark 1, the first commercial stored- 
program electronic digital computer), he 
included a random-number generator, 
which allowed the computer to make guesses 
and learn from its mistakes. 

Turing’s deterministic universal machine 
receives the most attention, but his non-deter- 
ministic oracle machines are closer to the way 
in which intelligence really works: intuition 
bridging the gaps between logical sequences. 
Turing’s oracle machines are no longer theo- 
retical abstractions — an Internet search 
engine, for instance, operates deterministi- 
cally until a person clicks on a link, adding 
non-deterministically to the search engines 
map of where the meaningful information is. 

Turing wanted to know how molecules 
were able to collectively self-organize, and 
whether machines could think. Von Neu- 
mann wanted to know how the brain worked 
and whether machines could reproduce. 
Turing, who died at the age of 41, left behind 
an unfinished theory of morphogenesis, and 
von Neumann, who died aged 53, left an 
unfinished theory of self-reproduction — a 
model inspired by the Turing machine's abil- 
ity to generate copies of itself. 

Had Turing and von Neumann lived 
longer, we can only imagine how their ideas 
might have merged. Their lives were both cut 
short just as the mechanism underlying the 
translation between sequence and structure 
in biology was revealed. = 


George Dyson is a writer based in 
Bellingham, Washington, and author of 
Turing’s Cathedral. 

e-mail: Gdyson@ias.edu 
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Life’s code script 


Turing machines and cells have much in common, argues Sydney Brenner. 


iological research is in crisis, and in 
B Alan Turing’s work there is much to 

guide us. Technology gives us the 
tools to analyse organisms at all scales, but 
we are drowning in a sea of data and thirst- 
ing for some theoretical framework with 
which to understand it. Although many 
believe that ‘more is better; history tells us 
that ‘least is best. We need theory and a firm 
grasp on the nature of the objects we study 
to predict the rest. 

Three of Turing’s papers are relevant to 
biology. In 1952, “The chemical basis of mor- 
phogenesis’' explored the hypothesis that 
patterns are generated in plants and animals 
by “chemical substances called morpho- 
gens, reacting together and diffusing 
through a tissue”. Using differential 
equations, Turing set out how instabil- 
ities in a homogeneous medium could 
produce wave patterns that might account 


for processes such as the segregation of 


tissue types in the developing embryo. 

Yet biological support for Turing’s 
idea has been marginal. The pre- 
ordered patterns found in Drosophila 
development do not fit the instability 
theory, which, until recently, could 
describe only chemical systems. Skin 
patterning has, however, been shown 
to follow a broader interpretation of 
Turing’s terms’, where cell-to-cell sig- 
nalling pathways, rather than individual 
molecules, are considered. The ion channels 
postulated by Alan Lloyd Hodgkin and 
Andrew Huxley’, also in 1952, were dis- 
covered more immediately by molecular 
biology. 

Turing published another biology-related 
paper, in 1950. ‘Computing machinery and 
intelligence” introduced the Turing test as 
an imitation game in which an outside inter- 
rogator tries to distinguish between a com- 
puting machine and a human foil through 
their responses to questions. But the Turing 
test does not say whether machines that 
match humans have intelligence, nor does 
it simulate the brain. For that, we need a 
theory for how the brain works. 

The most interesting connection with 
biology, in my view, is in Turing’s most impor- 
tant paper: ‘On computable numbers with an 
application to the Entscheidungsproblem’’, 
published in 1936, when Turing was just 24. 

Computable numbers are defined as 
those whose decimals are calculable by finite 
means. Turing introduced what became 
known as the Turing machine to formalize 


the computation. The abstract machine 
is provided with a tape, which it scans one 
square at a time, and it can write, erase or 
omit symbols. The scanner may alter its 
mechanical state, and it can ‘remember’ pre- 
viously read symbols. Essentially, the system 
is a set of instructions written on the tape, 
which describes the machine. Turing also 
defined a universal Turing machine, which 
can carry out any computation for which an 
instruction set can be written — this is the 


origin of the digital computer. 

Turing’s ideas were carried further in the 
1940s by mathematician and engineer John 
von Neumann, who conceived of a ‘con- 
structor’ machine capable of assembling 
another according to a description. A uni- 
versal constructor with its own description 
would build a machine like itself. To 
complete the task, the universal construc- 
tor needs to copy its description and insert 
the copy into the offspring machine. Von 
Neumann noted that if the copying machine 
made errors, these ‘mutations’ would 
provide inheritable changes in the progeny. 

Arguably the best examples of Turing’s and 
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von Neumann's machines are to be found in 
biology. Nowhere else are there such com- 
plicated systems, in which every organism 
contains an internal description of itself. The 
concept of the gene as a symbolic represen- 
tation of the organism — a code script — isa 
fundamental feature of the living world and 
must form the kernel of biological theory. 
Turing died in 1954, one year after the 
discovery of the double-helical structure of 
DNA by James Watson and Francis Crick, 
but before biology’s subsequent revolution. 
Neither he nor von Neumann had any direct 
effect on molecular biology, but their work 
allows us to discipline our thoughts about 
machines, both natural and artificial. 
Turing invented the stored-program 
computer, and von Neumann showed that 
the description is separate from the uni- 
versal constructor. This is not trivial. 
Physicist Erwin Schrédinger confused 
the program and the constructor in 
his 1944 book What is Life?, in which 
he saw chromosomes as “architect’s 
plan and builder’s craft in one”. This 
is wrong. The code script contains 
only a description of the executive 
function, not the function itself. 
Thus, Hodgkin and Huxley’s 
equations represent properties of the 
nerve impulse as an electrical circuit, 
but the required channels and pumps are 
constructed from specifications in the genes. 
Our problems reside in understanding the 
constructor part of the machinery, and here 
the cell is the right level of abstraction’. 
Biologists ask only three questions of a 
living organism: how does it work? How is 
it built? And how did it get that way? They 
are problems embodied in the classical fields 
of physiology, embryology and evolution. 
And at the core of everything are the tapes 
containing the descriptions to build these 
special Turing machines. m 


Sydney Brenner is a senior fellow at the 
Janelia Farm Research Campus, Howard 
Hughes Medical Institute, Ashburn, 
Virginia, 20147, USA. 
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Is the brain a good model 
for machine intelligence? 


To celebrate the centenary of the year of Alan Turing’s 
birth, four scientists and entrepreneurs assess the 
divide between neuroscience and computing. 


RODNEY BROOKS 
Avoid the cerebral 
blind alley 


Emeritus professor of robotics, 
Massachusetts Institute of Technology 


I believe that we are in an intellectual cul-de- 
sac, in which we model brains and computers 
on each other, and so prevent ourselves from 
having deep insights that would come with 
new models. 

The first step in this back and forth was 
made by Alan Turing. In his 1936 paper’ 
laying the foundations of computation, 
Turing used a person as the basis for his 


model. He abstracted the actions ofa human 
‘computer’ using paper and pencil to per- 
form a calculation (as the word meant then) 
into a formalized machine, manipulating 
symbols on an infinite paper tape. 

But there is a worry that his version of 
computation, based on functions of inte- 
gers, is limited. Biological systems clearly 
differ. They must respond to varied stimuli 
over long periods of time; those responses 
in turn alter their environment and subse- 
quent stimuli. The individual behaviours of 
social insects, for example, are affected by 
the structure of the home they build and the 
behaviour of their siblings within it. 

Nevertheless, for 70 years, those people 
working in what is now called computa- 
tional neuroscience have assumed that the 
brain is a computer — a machine that is 
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equivalent to Turing’s finite-state machine 
with an infinite tape and a finite symbol set, 
and that does computation. 

In 1943, Warren McCulloch and Walter 
Pitts’ noted the “all-or-none” nature of the 
firing of neurons in a nervous system, and 
suggested that networks of neurons could 
be modelled as logical propositions. They 
modelled a network of neurons as circuits of 
logic gates, noting that these may “compute 
only such numbers as cana Turing machine’. 
But more, they proposed that everything at 
a psychological level happens in these net- 
works. Over the decades, such ideas begat 
more studies in neural networks, which in 
turn begat computational neuroscience. 
Now those metaphors and models pervade 
explanations of how the brain ‘computes. But 
these binary abstractions do not capture all 
the complexities inherent in the brain. 

So now I see circles before my eyes. The 
brain has become a digital computer; yet we 
are still trying to make our machines intelli- 
gent. Should those machines be modelled on 
the brain, given that our models of the brain 
are performed on such machines? That will 
probably not be enough. 

When you are stuck, you are stuck. We 
will get out of this cul-de-sac, but it will take 
some brave and bright souls to break out of 
our circular confusions of models. 


DEMIS HASSABIS 
Model the brain’s 
algorithms 


Neuroscientist, computer -game 
producer and chess master, 
University College London 


Alan Turing looked to the human brain as 
the prototype for intelligence. If he were alive 
today, he would surely be working at the inter- 
section of natural and artificial intelligence. 

Yet to date, artificial intelligence (AI) 
researchers have mostly ignored the brain 
as a source of algorithmic ideas. Although 
in Turing’s time we lacked the means to look 
inside this biological “black box, we now 
have a host of tools, from functional mag- 
netic resonance imaging to optogenetics, 
with which to do so. 

Neuroscience has two key contributions 
to make towards progress in AI. First, the 
many structures being discovered in the 
brain — such as grid cells used for naviga- 
tion, or hierarchical cell layers for vision 
processing — may inspire new computer 
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algorithms and architectures. Second, 
neuroscience findings may validate the plau- 
sibility of existing algorithms being integral 
parts ofa general AI system. 

To advance AI, we need to better under- 
stand the brain’s workings at the algorithmic 
level — the representations and processes 
that the brain uses to portray the world 
around us. For example, if we knew how 
conceptual knowledge was formed from per- 
ceptual inputs, it would crucially allow for the 
meaning of symbols in an artificial language 
system to be grounded in sensory ‘reality’ 

Al researchers should not only immerse 
themselves in the latest brain research, but 
also conduct neuroscience experiments to 
address key questions such as: “How is con- 
ceptual knowledge acquired?” Conversely, 
from a neuroscience perspective, attempt- 
ing to distil intelligence into an algorithmic 
construct may prove to be the best path to 
understanding some of the enduring mys- 
teries of our minds, such as consciousness 
and dreams. 


DENNIS BRAY 
Brain emulation 
requires cells 


Department of Physiology, 
Development and Neuroscience, 
University of Cambridge 


Machines can match us in many tasks, but 
they work differently from networks of nerve 
cells. If our aim is to build machines that are 
ever more intelligent and dexterous, then we 
should use circuits of copper and silicon. But 
if our aim is to reproduce the human brain, 
with its quirky brilliance, capacity for multi- 
tasking and sense of self, we have to look for 
other materials and different designs. 
Computers outperform us in complex 
mathematical calculations and are better at 
storing and retrieving data. We accept that 
they can beat us at chess — once regarded 
as the apogee of human intellect. But the 
success of a computer called Watson in US 
television quiz show Jeopardy! in 2011 was 
a nail in the coffin of human superiority. 
The machine beat two human contestants 
by answering questions posed in colloquial 
English, making sense of cultural allusions, 
metaphors, puns and jokes. If Alan Turing 
had been given a transcript of the show, 
would he have spotted the odd one out? 
Watson may be the latest vindication of 
Turing’s view of intellectual processes as a 
series of logical states. But its internal work- 
ings are not based on the human brain. Broad 
similarities in organization might be imposed 
by the nature of the task, but most software 
engineers neither know nor care about 


anatomy or physiology. Even biologically 
inspired approaches such as cellular autom- 
ata, genetic algorithms and neural networks 
have only a tenuous link to living tissue. 

In 1944, Turing confessed his dream of 
building a brain, and many people continue 
in that endeavour to this day. Yet any neuro- 
biologist will view such attempts as naive. 
How can you represent a neuronal synapse — 
a complex structure containing hundreds of 
different proteins, each a chemical prodigy in 
its own right and arranged in a mare’s nest of 
interactions — with a single line of code? We 
still do not know the detailed circuitry of any 
region of the brain well enough to reproduce 
its structure. Brains are special. They steer us 
through the world, tell us what to do or say, 
and perform myriad vital functions. Brains 
are the source of our emotions, motivation, 
creativity and consciousness. Because no one 
knows how to reproduce any of these features 
in an artificial machine, we must consider 
that something important is missing from 
the canonical microchip. 

Brains differ from computers in anumber 
of key respects. They operate in cycles rather 
than in linear chains of causality, sending 
and receiving signals back and forth. Unlike 
the hardware and software of a machine, the 
mind and brain are not distinct entities. And 
then there is the question of chemistry. 

Living cells process incoming sensory 
information and generate not just electri- 
cal signals but subtle biochemical changes. 
Cells are soft, malleable and built from an 
essentially infinite variety of macromolecular 
species quite unlike silicon chips. Organisms 
encode past experiences in distinct cellular 
states — in humans these are the substrate 
of goal-oriented movements and the sense 
of self. Perhaps machines built from cell-like 
components would be more like us. 


AMNON SHASHUA 
Speed will trump 
brain’s advantages 


Sachs Professor of Computer Science, 
Hebrew University of Jerusalem, and 
co-founder and chairman of Mobileye 


The saying that “people who are really 
serious about software should make their 
own hardware’, attributed to computer 
scientist Alan Kay in the 1980s, still rings 
true today. The idea that the function and 
form of computing architecture should serve 
each other is at the root of algorithms in 
signal processing, image rendering, gaming, 
video compression and streaming. I believe 
that it is also true for the human brain — 
meaning that the brain does not implement 
‘intelligence’ in the same way as a computer. 


Two of the many fundamental differences 
between the brain and the computer are 
memory and processing speed. The analogue 
of long-term memory in a computer is 
the hard disk, which can store practically 
unlimited amounts of data. Short-term infor- 
mation is held in its random access memory 
(RAM), the capacity of which is astronomical 
compared with the human brain. Such quan- 
titative differences become qualitative when 

considering strategies 


“Signals in for intelligence. 

the brain are Intelligence is mani- 
transmitted — fested by the ability to 
at asnail’s learn. Machine-learning 
pace.” practitioners use ‘stat- 


istical learning’ which 
requires a very large collection of examples 
on which to generalize. This ‘frequentist’ 
approach to probabilistic reasoning needs 
vast memory capacity and algorithms that are 
at odds with available data on how the brain 
works. For example, IBM computer Watson 
needed to consume terabytes of reference 
material to beat human contestants on Jeop- 
ardy!. Volvos pedestrian-detection system 
(developed by Mobileye) learned to identify 
people by using millions of pictures. In both 
cases, the human brain is considerably more 
parsimonious in the reliance on data — some- 
thing that does not constrain the computer. 

In terms of processing power, the brain 
can reach about 10-50 petaflops — equiva- 
lent to hundreds of thousands of the most 
advanced Intel Core i7 CPUs. Yet signals 
in the brain are transmitted at a snail’s pace 
— five or six orders of magnitude slower 
than modern CPUs. This huge difference in 
communication speed drives vastly different 
architectures. 

The brain compensates for the slow 
signal speed by adopting a hierarchical paral- 
lel structure, involving successive layers with 
increasing receptive field and complexity. By 
comparison, a computer architecture is usu- 
ally flat and, because of its much faster clock 
rate, can employ brute-force techniques. 
Computer chess systems such as Deep Blue 
use pattern-recognition strategies, such as 
libraries of opening moves and completely 
solved end-games, complemented by their 
ability to evaluate the outcomes of some 
200 million moves per second. This is way 
beyond the best grandmaster. 

An intimate understanding of how cogni- 
tive tasks are performed at an algorithmic 
level would allow artificial intelligence to 
grow in leaps and bounds. But we must bear 
in mind that the vastly different architec- 
ture of the computer favours strategies that 
make optimal use of its practically unlimited 
memory capacity and brute-force search. m 
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Pattern formation 


We are only beginning to see the impact of Turing’s 
influential work on morphogenesis, says John Reinitz. 


lan Turing’s 1952 paper on the origin 
A: biological patterning’ solved an 
intellectual problem that had seemed 
so hopeless that it caused a great develop- 
mental biologist, Hans Driesch, to give up 
science and turn to the philosophy of vitalism. 

In the late nineteenth century, Driesch, 
and later Hans Spemann, demonstrated that 
animal bodies develop from a patternless 
single cell, rather than growing from a micro- 
scopic, preformed version of the adult body 
— in humans, the ‘homunculus’ But such 
self-organization, Driesch realized, could not 
be understood with the ideas of that century. 
Before the invention of computers, applied 
mathematics dealt only with linear differen- 
tial equations, which can amplify a pattern 
but not generate it. 

In ‘The chemical basis of morphogenesis, 
Turing showed that a pattern can indeed form 
de novo. In considering how an embryos 
development unfolds instant by instant from 
its molecular and mechanical state, Turing 
was using a modern approach. Developmen- 
tal biologists today similarly investigate how 
molecular determinants and forces exerted by 
cells control embryonic patterning. 

Turing’s focus was on chemical patterns: he 
coined the term ‘morphogen as an abstrac- 
tion for a molecule capable of inducing tissue 
differentiation later on. This concept will be 
familiar to any molecular biologist: the pro- 
tein products of the HOX gene cluster, for 
example, which are essential for body pat- 
terning throughout the animal kingdom, are 


morphogens in Turing’s sense. (Confusingly, 
the term has been more narrowly defined 
since.) 

At the heart of pattern-making is sym- 
metry-breaking. Turing considered an 
idealized embryo beginning with a uniform 
concentration of morphogens, which have 
translational symmetry that is lost as specific 
tissues emerge. He raised deep questions that 
are still unsolved, noting for instance that all 
physical laws known at the time had mirror- 
image symmetry, but biological systems did 
not. Turing speculated that the asymmetry of 
organisms originated from that of biological 
molecules. His point is still relevant to life’s 
origins. 

Turing’s argument involved a mathemati- 
cal trick: he created a nonlinear system by 
turning on diffusion discontinuously in an 
otherwise linear system at a specific instant. 
Without diffusion, the system is stable and 
homogeneous, but with diffusion, it becomes 
unstable and forms spatial pattern. The bril- 
liance of the trick is that the nonlinearity is 
confined to a single point in time, so that at 
all other times, only the theory of linear equa- 
tions is needed. Turing cleverly arranged to 
have diffusion generate pattern, rather than 
blur it, as it usually does. 

The influence of Turing’s paper is difficult 
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to overstate. It was a transition point from 


the era of analytical mathematics to that of 


computational mathematics. Although his 
proof was constructed analytically, Turing’s 
paper contains the first computer simula- 
tions of pattern formation in the presence 
of stochastic fluctuations, and is possibly the 
first openly published case of computational 
experimentation. 

Turing used analytical arguments of the 
nineteenth century to point the way towards 
the computational science of the twenty-first 
century. He was well aware, however, that 
nonlinear science and developmental biology 
would require more advanced computational 
methods. “Most of the organism, most of the 
time, is developing from one pattern into 
another, rather than from homogeneity into 
a pattern,” he stated'. He realized that even 
though an embracing theory for such pro- 
cesses might not be possible, individual cases 
could be modelled with a digital computer. 

Yet Turing’s work is frequently misinter- 
preted, perhaps because he died tragically in 
1954, before he could correct the record. His 
analytical arguments are often mistaken for 
biological predictions, although Turing did 
not intend them as such. His hypothetical 
system, based on two substances, was a sim- 
plification. For the pattern-forming trick to 
work, one substance should catalyse synthesis 
of both substances while diffusing slowly; the 
other should catalyse destruction of both sub- 
stances while diffusing rapidly. For patterns 
that shift over time, three substances would 
be required. A field of investigation of these 
models has sprung up’, but credit or blame for 
the results rests with those authors, not Turing. 

What Turing should receive credit for is 
opening the door to a new view of develop- 
mental biology, in which we deal directly 
with the chemical reactions and mechani- 
cal forces embryos use to self-organize their 
bodies from a single cell. He was well ahead 
of his time. It was three decades before the 
work on Drosophila embryos by Lewis’, 
Wieschaus and Niisslein-Volhard’ led to the 
discovery of real morphogens. It is the young 
researchers of today who will benefit most 
from reading Turing'’s work — seeing his ideas 
about morphogenesis not as speculation but 
as the conceptual framework for concrete 
problems. = 


John Reinitz is in the departments of 
statistics, ecology and evolution, and 
molecular genetics and cell biology at the 
University of Chicago, Chicago, Illinois 
60637, USA. 
reinitz@galton.uchicago.edu 
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The incomputable reality 


The natural world’s interconnectivity should inspire better 
models of the Universe, says Barry Cooper. 


lan Turing put bounds on what is 
Aensstic ina famous 1936 paper’. 

The Turing machines he presented 
implement finite algorithms, handling data 
coded as real numbers. They are determin- 
istic, but give some bizarre results. You can 
build a universal machine that can simulate 
any other Turing machine. But not every 
question you can ask of it has a computable 
answer: you cannot predict, for example, 
whether it ever spits out a given number or 
series of numbers. 

By coincidence, our Newtonian view of 
physics faltered at about the same time as 
our computable view of mathematics. Lin- 
gering problems in classical physics, such 
as the unpredictable trajectories of three 
bodies following a collision, may involve 
incomputability. Albert Einstein’s theory 
of general relativity opens up a new world 
of computation with exotic objects such as 
spinning black holes. Quantum mechanics 
tells us that measurements are inherently 
uncertain. 

The concept of computability 
is basic to modern science, from 
quantum gravity to artificial intel- 
ligence. It is also relevant in the 
everyday world, where it is useful to 
distinguish problems that are merely 
difficult to compute in practice from 
those that are intrinsically impossi- 
ble with any machine. Incomputabil- 
ity should trouble economists, because 
breakdowns of control in chaotic mar- 
kets can wreak havoc. 

But disciplinary boundaries are 
preventing us from getting a full view 
of its role. Cosmetic differences may 
hide revealing parallels. 


EMERGENT PHENOMENA 

Turing was interested in the mathematics 
of computing and also in its embodiment 
— the material environment that houses 
it. This theme links all of his work, from 
machines to the brain and morphogen- 
esis. Although many mathematicians and 
software engineers today see it as irrel- 
evant, embodiment is key to explaining 
the physical world. 

Take turbulence: a river swollen by recent 
rain occasionally erupts into surprising for- 
mations that we would not expect from the 
basic dynamics of the water flow. The rea- 
son is coherence — non-local connectivity 
affects the water’s motion. Turbulence, and 
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other ‘emergent’ nonlinear phenomena, may 
not be computable with a Turing machine. 
Zebra stripes and tropical-fish patterns, 
which Turing described in 1952-54 with his 
differential equations for morphogenesis, 
arise similarly. 

Even in nonlinear systems, such high- 
order behaviour is causal — one phenom- 
enon triggers another. Levels of explanation, 
from the quantum to the macroscopic, can 
be applied. But modelling the evolution of 
the higher-order effects is difficult in any- 
thing other than a broad-brush way. Such 
problems infiltrate all our models of the 
natural world. 


The Universe is like that turbulent stream 


— its behaviour as a whole guided by myriad 
connections at various scales. It has many 
emergent levels of causality, bridged by 
phase transitions. The mechanistic struc- 
ture that science deals with so well, and its 
invariant laws, are hard to explain in terms 
of the quantum level. Biology emerges from 
the quantum world, but is not computable 
from it. We are part of an organic whole — 
fragmented but coherent. 

Across these boundaries, higher-level 
relations can feed back into lower ones. But 
looking up from a lower level, the causality 
will not be computable. For example, the 
uncertainty principle prevents the quan- 
tum world from fully describing the state 


of a particle at any instant. A measurement 
produces a full description, but we cannot 
compute how it does it. In Turing’s world, a 
description of reality is not always enough 
for a computable prediction. 

Nature presents us with new ways of 
computing, from the Universe to the brain. 
Turing went on to build logical hierarchies to 
better understand real-world computation, 
which includes intuitive or unpredictable 
leaps’. Researchers experimenting with intel- 
ligent machines today see the possibilities in 
such an approach. But problems of control 
of higher-order behaviours still present 
formidable challenges to implementing it. 


BRIDGE BUILDING 

It took nature millions of years to build a 
human brain. Meanwhile, we have to live 
with the stupidity of purely algorithmic 
processes. We need to embrace more experi- 
mental approaches to computation, and a 
renewed respect for embodied computing 
— as anticipated in Turing’s late work in the 
1950s on artificial intelligence and morpho- 
genesis. 

Bridges between mathematicians and 

physicists are important if we are to do this. 
Itis along time since Kurt Godel and Albert 
Einstein chatted in the halls of Princeton 
University in New Jersey. Mathemati- 
cians can bring to the table Turing’s model 
of basic causal structure. This would help 
physicists to discover more complete descrip- 
tions of the Universe — making redundant 
Hugh Everett’s many-worlds interpretation 
and related multiverse hypotheses — and 
fix the arbitrariness of parts of the standard 
model of particle physics. 

Samson Abramsky, a computer scientist at 
the University of Oxford, UK, recently asked: 
“Why do we compute?” Turing computation 
does not create anything that is not there 
already in the initial data. Can information 
increase in computation? 

If we look at the world with new eyes, 
allowing computation full expression, we 
may come to startling conclusions. = 


Barry Cooper is in the School of 
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Leeds LS2 9JT, UK. 
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Our brains recreate the emotions of actors such as Geraldine James when we watch them perform. 


Powerful acts 


Giovanni Frazzetto explores how theatre exerts its 
psychological effects on the emotions. 


rom rage and grief to exquisite 
f tendresse, emotion is laid bare in 

theatre. Few art forms electrify or 
illuminate as powerfully as stage acting. 
But how have theatrical greats such as John 
Gielgud or Vanessa Redgrave cast their spell? 
Acting may be one of the most ancient arts, 
but science is only just beginning to get to 
grips with it. 

Science started to seep into theatre in the 
late nineteenth and early twentieth centuries, 
with the Russian actor and theatre direc- 
tor Constantin Stanislavski. Founder of the 
influential Moscow Art Theatre, Stanislavski 


turned to physiologist 

Ivan Pavlov’s research Mi 
onconditioned reflexes  Areview of Danny 
to improve his acting _ Boyle's staging of 
method. The aim was _ Frankenstein: 


to create performance 


that united psychological experience and 
physical action. 

Stanislavski sought a way to consciously 
trigger an actor’s emotional expression. 
Science had begun to discover that neu- 
ral pathways underlie complex behaviour 
and emotions, which can be conditioned 
in response to a changing environment. By 
practising key physical actions pertinent to 
the character and the play, Stanislavski real- 
ized, the actor could learn, by reflex, how to 
express the psychological experience of the 
emotion — with help from the imagination. 
A particular posture or movement would 
trigger a particular emotion. So by working 
hard on small actions such as clenching the 
fists and tensing the neck muscles, the actor 
could trigger anger, or they could awaken 
feelings of despair by shuffling, drooping and 
bowing the shoulders. 
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But, as we now know, the psychology of 
performance is more complex than this. 
To deliver a believable performance, actors 
need to remember not just emotions, body 
postures and expressions, but also their cues 
and lines. And, more importantly, they must 
seek ways of engaging with their audience to 
evoke empathy — the recognition or sharing 
of emotional states. 

German philosopher Theodor Lipps 
was the first to use the term empathy (Ein- 
fiihlung, literally ‘feeling into’) in the early 
twentieth century as a way of describing the 
relationship between artwork and observer 
in the psychology of aesthetic experience. In 
the 1990s, Italian neurophysiologists Gia- 
como Rizzolatti, Vittorio Gallese and their 
colleagues at the University of Parma in Italy 
offered a neurological framework for study- 
ing empathy through their discovery of mir- 
ror neurons — cells that fire both when we 
perform an action, and when we observe 
someone else performing it. They showed 
that our visual-motor system is activated as 
if we were executing an action that we are 
simply watching: the brain simulates that 
action. 

These discoveries have resonated widely 
among theatrical professionals. They extend 
to intentionality (thinking ofa way of doing 
some action), imitation (the replication of an 
action) and action understanding (grasping 
the import ofan action) — all central to act- 
ing technique. 

The actor must also demand something 
else of the audience: a suspension of dis- 
belief. The phrase was coined in 1817 by 
English poet and philosopher Samuel 
Taylor Coleridge to describe how weaving 
enough facts into a fantastical narrative 
will help readers to accept the story, rather 
than judge it as implausible. Film-makers 
suspend disbelief by exploiting the power 
of moving images 
in a darkened 
room, which lures 
an audience into 
their simulacrum 
of reality. In thea- 
tre, suspension of 
disbelief hinges 
on the switch 
between two realities — the set and the 
cast of actors, and the places and characters 
they represent. In the prologue to Henry V, 
Shakespeare asks the audience to transform 
the bare stage by seeing it as the world of 
the king at war with France: “Piece out our 
imperfections with your thoughts ... and 
make imaginary puissance.” 

Twentieth-century German playwright 
Bertolt Brecht deliberately turned this 
tactic on its head in his ‘epic theatre’ sys- 
tem. By using techniques such as having 
the actors suddenly sing out of character, 
he ensured that his audiences became 


emotionally detached from the characters. 
The audience members then became aware 
that they were witnessing fiction and were 
able to critically question the social reali- 
ties represented in the play. 

The cognitive processes underlying the 
suspension of disbelief have been the sub- 
ject of several scientific studies. In 2010, 
Marie-Noélle Metz-Lutz of the University 
of Strasbourg, France, and her colleagues 
used functional magnetic resonance imag- 
ing (fMRI) to scan the brains of people 
watching a play to pinpoint when they 
were transported into another reality. This 
was defined as when the subject’s brain 
response tallied with a passage in the script 
intended to elicit such a response. The 
brain regions that fired at those moments 
included two areas involved in process- 
ing language and, specifically, in under- 
standing metaphor, denoting the power of 
language to capture a spectator’s attention. 
Both regions are also involved in processes 
of social and aesthetic judgements, prob- 
ably governing appreciation of the writing 
style, plot or characterization. 

The French team also found that the 
subjects’ heart rates slowed during trans- 
portation, and that brain activity fell in areas 
involved in building consciousness about the 
self and the external world. Without activ- 
ity in these regions, an observer will take 
the fictionalized reality of the play at face 
value, despite the sensory perception of the 
stage, set and actors. Such results point to 
complete absorption in a play as a sort of 
hypnotic state involving the temporary loss 
of self-reference, and a disconnection from 
immediate sensory information — a distinct 
feeling of being ‘carried away. 

In theory, such scanning experiments 
might help playwrights to identify specific 
language and theatrical devices that trigger 
audiences to become as absorbed as possible, 
and so enrich acting as an art and theatre as a 
vehicle of meaning and ‘enchantment. With 
a nod to Stanislavski, playwrights could 
focus on what movements or expressions 
are the most poignant, and which are most 
effective at conveying grief, compassion or 
joy. Such studies could also reveal which 
metaphors express an action or thought with 
the most brevity and wit, and what elements 
of plot device or vocal emphasis can make a 
difference in the brain. 

Yet fMRI images and statistics will never 
replace the unpredictability and revelatory 
power of what is born in the rehearsal room. 
Acting is predicated on technique and craft, 
but remains visceral and intuitive. m 
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Books in brief 


DNA USA: A Genetic Portrait of America 

Bryan Sykes W. W. NoRTON 320 pp. £19.99 (2012) 

The US human population is a bouillabaisse of DNA. Geneticist 
Bryan Sykes took on the challenge of identifying its ingredients 

on an epic cross-country trip. He recounts the detective work — 
including interviews with genealogists and fellow geneticists — 
and methodology behind the findings. How did European genes 
appear in the DNA of Native Americans some 10,000 years ago, 
for instance? And why does the southwestern Hispanic population 
contain genes typically found in Jewish people? Ultimately, Sykes 
suggests, the country is an even richer human mix than we thought. 


The Emotional Life of Your Brain: How Its Unique Patterns Affect the 
Way You Think, Feel, and Live — And How You Can Change Them 
Richard J. Davidson and Sharon Begley HUDSON STREET PRESS 

279 pp. $25.95 (2012) 

Why do some people plod stoically through crises while others 
collapse? Science writer Sharon Begley and neuropsychologist 
Richard Davidson argue that each of us has an ‘emotional style’: a 
pattern of responses to life’s events that is allied to underlying brain 
systems. Looking at dimensions from social intuition to context 
sensitivity, the authors suggest that we can achieve better equilibrium 
by rewiring our emotional style through research-inspired exercises. 


Game Changer: Animal Rights and the Fate of Africa’s Wildlife 
Glen Martin UNIVERSITY OF CALIFORNIA PRESS 243 pp. £20.95 (2012) 
Africa’s wild megafauna are caught in the crossfire between animal- 
welfare campaigners and conservationists, argues environmental 
reporter Glen Martin. In this pacy, unsentimental account, Martin 
interviews seasoned conservation biologists, zoologists and game 
wardens, focusing on practice in Kenya, Namibia and Tanzania. He 
concludes that holistic strategies incorporating habitat conservation, 
controlled hunting and respect for local people’s needs are workable 
—and points out that measures such as ecotourism and protection 
for iconic species have backfired dramatically. 


The Undead: Organ Harvesting, the Ice-Water Test, Beating 
Heart Cadavers — How Medicine Is Blurring the Line Between 
Life and Death 

Dick Teresi PANTHEON 368 pp. $26.95 (2012) 

The moment of death, suggests science writer Dick Teresi, is harder 
to pin down than ever. He introduces us to those who work at this 
borderline: cell biologists, specialist doctors, undertakers and people 
who have recovered from comas. Charting historical definitions of 
death, the thinking of research greats and debates over near-death 
experiences, Teresi notes that the ethical challenges are immense, 
asking, for instance, whether all organ donors are unrevivable. 


The Forest Unseen: A Year’s Watch in Nature 

David George Haskell VIKING 288 pp. $25.95 (2012) 

Training a biologist’s eye on ecology, geology and climate, David 
Haskell visited a square metre of old-growth forest in southeastern 
Tennessee nearly every day for a year. His observations — of lichens, 
snowflakes, salamanders and more — are deftly interwoven with 

the science. His account is fascinating, whether he’s stripping 

off in January to experience the physiological effects of severe 

cold, describing the symphonic sounds of trees in a high wind, or 
wondering at the bacteriocidal properties of a vulture’s digestive tract. 
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INFECTIOUS DISEASE 


Chronicles of a 
killer virus 


Just over 30 years after HIV/AIDS was first recognized, 
three accounts of its ravages intrigue Robin Weiss. 


s a frightening pandemic associated 
A@= sex, blood and death, AIDS 

was bound to evoke a rich mythol- 
ogy. Nicoli Nattrass’s The AIDS Conspiracy 
deals with those myths and how scien- 
tific arguments counteract them. Jacques 
Pépin’s The Origins of AIDS looks back at 
the emergence of HIV in the era before the 
syndrome was recognized, and Victoria 
Harden’s AIDS at 30 covers the period after 
its identification in 1981. 

The AIDS Conspiracy is essential reading 
for anyone who is curious about why some 
people will not accept scientific facts about 
the nature, origin and lethality of HIV. As 
an HIV researcher, I used to divide people's 
strange beliefs about AIDS into myths of 
denial, and of blame and conspiracy. But 
Nattrass, who directs the AIDS and Society 
Research Unit at the Univer- 
sity of Cape Town in South 
Africa, explains how 
HIV denialism has also 
become a conspiratorial 
attack on science and 
medicine — one that 
aims to convince peo- 
ple that antiretroviral 
therapy is more harmful 
than the ‘blameless’ virus. 

Even when HIV is 
accepted as the cause of 
AIDS, Africa is blamed for its 
origin. Yet new diseases can 
arise anywhere: BSE or ‘mad 
cow disease’ in the United 
Kingdom, SARS in China 
and the 2009 HIN1 influ- 
enza pandemic in Mex- 
ico. Some AIDS creation 
myths continue to have 
an allure — for example, 
that HIV came out of oral 
polio vaccines, or that the virus 
is a man-made germ-warfare 
agent that was deliberately released in 
Africa by the United States. 

Nattrass identifies four types of HIV 
denialist: the dissident scientist who lends 
credibility; the ‘cultropreneur’ who ped- 
dles quack therapies; the living icon or 
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long-term survivor; and the praise-singer 
or journalist who sows doubt about HIV 
causing AIDS. The dissidents are a tiny 
group, yet their campaign against anti- 
retroviral therapy in South Africa has been 
estimated as leading to more than 300,000 
preventable AIDS-related deaths. Science 
can respond through mechanisms such as 
the Durban Declaration on the link between 
HIV and AIDS, which was signed by more 
than 5,000 scientists and physicians (Nature 
406, 15-16, 2000). Nattrass also points out 
that social-media activists have often 
been more effective at tackling HIV 
denialism than official bodies. 
In the superb The Origins of AIDS, 
Pépin — a Canadian epidemiologist 
who has worked across Africa — delves 
into the early phases of HIV emer- 
gence. After the ancestor of the 
pandemic HIV-1 group M 
passed from a chim- 
panzee to a human 
in southeast Cam- 
eroon about 100 
years ago, a few 
infected people 
travelled down the 
River Congo. AIDS 
became a commu- 
nity disease in Léopoldville (now Kinshasa), 
the capital of the Belgian Congo depicted in 
Joseph Conrad's 1903 novella Heart of Dark- 
ness. HIV began to thrive in its new host, 
Pépin shows, for several reasons. 

There was a surge in medical injections 
using non-sterile syringes in mid-twen- 
tieth-century Africa, giving the trans- 
mission of HIV (and hepatitis viruses) a 
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crucial helping hand — a theory previ- 
ously postulated by US epidemiologist 
Ernest Drucker. No Congolese physicians 
were trained under colonial rule, and 
when Belgium abandoned the Congo in 
1960, few colonial or missionary doctors 
remained. Pépin describes a society in tur- 
bulent transition, with ‘free women and 
migrants swelling the capital. HIV became 
a mainly sexually transmitted infection. 
After the unsuccessful war of secession 
in Katanga in southeast Congo, Haitians 
among the United Nations troops brought 
HIV to the West. Homosexual men went 
to Haiti for sex, and Luckner Cambronne, 
leader of the country’s Tonton Macoutes 
paramilitary force, sold more than 6,000 
litres of blood plasma a month to the 
United States. 

One question that Pépin skirts is why HIV 
prevalence in Kinshasa since 1980 remained 
relatively stable and low while it exploded 
elsewhere in Africa, spreading widely in 
southern Africa only in the 1990s. That 
mystery drives home the point that where a 
virus first enters the human population isn't 
necessarily where 


it blooms; roots “Where a virus 
are not shoots. first enters 
Another puzzle is the human 
ates a ee population isn’t 
necessarily 
successful com- horett binge 
pared with the Reign ten woe . 
other cross-spe- bs 
shoots. 


cies infections of 
HIV-1 groups N, 
O and P from apes, and of HIV-2 from mon- 
keys. Pépin rightly argues that, apart from 
social factors promoting HIV spread, inher- 
ent properties of the virus must determine 
its fitness to become pandemic. He also 
provides the best analysis I have read of the 
declining HIV-2 epidemic in West Africa. 

AIDS at 30 begins where Pépin leaves 
off, with the appearance of AIDS in the 
United States. Harden, the retired doy- 
enne of medical history at the US National 
Institutes of Health, draws extensively on 
that agency’s archives for her narrative of 
the scientific advances in understanding 
HIV/AIDS, its treatment and prevention. 
She is particularly strong on the challenges 
of formulating US public-health policies 
for AIDS. 

With 34 million people living with HIV, 
besides the 30 million it has already killed, 
seeking to understand the myths and the 
history of AIDS is surely important — 
although not as pressing as developing a 
safe and efficacious HIV vaccine. m 


Robin A. Weiss is professor of viral 
oncology in the Division of Infection & 
Immunity, University College London, UK. 
e-mail: r.weiss@ucl.ac.uk 


D. TREINIS 


Q&A Peter Diamandis 
The eternal optimist 


Peter Diamandis is the founder of the non-profit X Prize Foundation, which aims to kick-start 
research and development to solve humanity’ biggest challenges. On the publication this week of 
his book Abundance, co-authored with journalist Steven Kotler, he explains how technological 
and social progress will enable us to provide enough food, water and energy for all. 


Your book is optimistic about humanity’s 
future. But aren’t we already exceeding our 
planet’s carrying capacity? 

The carrying capacity for Earth is a relative 
number. If I have an orange grove and I can 
reach only the lowest oranges on my trees, 
I need five trees to feed my family. If I can 
build a ladder, then I need only one tree. 
If humanity were to run out of food, our 
‘ladder’ might be genetic engineering, or 
growing food hydroponically inside sky- 
scrapers. It’s not as if water is leaving the 
planet, or energy is not shining down, 
or were not recycling food. These are all 
replenishable resources once we are able to 
use them more efficiently. 


How can we use resources better? 

We're living on a water planet; the challenge 
is that 98% is salt water. But there are tech- 
nologies that can purify it — such as the 
Slingshot, a device the size of a mini refrig- 
erator that can run on cow dung, which the 
Coca-Cola Company is helping to trial in 
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Africa. With food, we 
now have the ability 
to go from evolution 
by natural selection 
to evolution by intel- 
ligent direction using 
genetic engineer- 
ing. We will make cleaner energy through 
solar and nuclear approaches, which are the 
only ones that can scale to meet our needs. 
Mobile-phone use is growing exponentially, 
and soon more than 70% of the world will 
have one; the Qualcomm Tricorder X Prize 
is asking teams to build a mobile app that 
allows users to diagnose themselves as well 
as a physician can. 


How quickly can such technologies develop? 
The world is full of exponential technolo- 
gies. When the Human Genome Project 
started in 1990, people said that it would 
take 50 years and would consume every 
scientist on the planet. These things can 
happen much more quickly. And now 
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there are new forces making them happen. 
Through the X Prize Foundation, I’ve wit- 
nessed individuals and small teams do things 
that, in the past, only governments could do. 
There is the DIY innovator: the person or 
team empowered by extraordinary technol- 
ogy, such as parents of sick children who 
create biolabs in their own kitchens. There 
are the techno-philanthropists: ‘centimil- 
lionaires’ who are being created younger and 
younger and tackling global challenges — 
just as Bill Gates has tackled malaria. In 2010 
there were some 2 billion people online; by 
2020 that is going to rise to 5 billion. These 
people are going to join the global economy, 
and innovate on a zero-cost basis. 


Are things better now than in the past? 
Over the twentieth century, lifespan has 
doubled. We're living in the most peaceful 
time ever. Every generation thinks that their 
problems are the biggest, but eventually we 
get around them. That was an important 
insight for me. People idealize the past but 
they forget how horrible it really was. Seeing 
problems is an evolutionary survival trait. 
The easiest way to survive is to be hyper- 
vigilant for problems. 


What about the growing gap between rich 
and poor? 

That’s immaterial at some point. In the 
United States, happiness correlates with 
income only up to about US$75,000. It’s 
about providing for your needs. If someone 
in Africa can have first-tier [basic] health 
and education and access to abundant 
energy, food and water, they might still have 
little income, but those changes represent a 
huge step forward. 


You write of fulfilling basic needs for free 

and automating menial tasks. How would 
that affect the future economy? 

I don't know. “Work as a defined activity 
in society didn’t exist for the first 100,000 
years of our species. It was invented. If I 
own a nanobot that can create my food, a 
shelter, a car and anything else, I have eve- 
rything I need. But what do you do with all 
your spare time? Does everyone become an 
artist? A thinker? An explorer? It is going to 
be interesting. 


Do you plan to live long enough to find out? 
When I was a first-year medical student I 
saw a television show about some turtles 
that might live as long as 700 years. SoI 
asked, if they can, why can't I? We are on 
the edge of a revolution in health. We're 
designing an organogenesis X Prize for 
spare body parts. I’m staying in touch with 
the smartest researchers and physicians 
Iknow. # 
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Mutant flu: preparing 
for a pandemic 


We at the global humanitarian 
organization Save the Children 
agree that controversy over lab- 
created H5N1 avian influenza 
virus should not detract from 
the larger concern of global 
preparedness for a flu pandemic 
(Nature 482, 131;2012). 

Ina pandemic flu situation, 
when all countries and 
responding organizations are 
stricken, we think it is unrealistic 
to hope that the most resource- 
poor communities around the 
world will receive adequate 
supplies of vaccine, antivirals 
or antibiotics. We believe in 
preparing now so that community 
leaders, and the organizations 
working with them, can mitigate 
the effects of a severe wave of 
flu in the absence of substantial 
outside resources. 

As the World Health 
Organization has noted, non- 
pharmaceutical interventions 
such as quarantine are crucial for 
an effective response, and may 
sometimes be the only means 
of delaying the spread of flu. Yet 
most national plans lack practical 
operational considerations (see 
go.nature.com/mi9sr3). 

Detailed authoritative guidance 
on reducing flu transmission at 
household and community levels, 
and on the home-based care of flu 
patients, in low-resource settings 
is the most important, and needs 
to be published. Support should 
also be provided to governments 
in developing countries to adapt 
this guidance for their settings. 

We believe that such efforts 
should be an urgent priority, and 
are concerned about this apparent 
gap in the most basic level of 
pandemic preparedness. 

Eric S. Starbuck Save the Children, 
Westport, Connecticut, USA. 
estarbuck@savechildren.org 


Mutant flu: assessing 
biosecurity risks 


In the ongoing controversy 
over the mutant H5N1 avian 


influenza research (Nature 

481, 9-10, 2012), we should be 
wary of reducing biosecurity 
measures merely to assigning 
access rights to sensitive 
information and materials. A 
national security body made up 
of military and law-enforcement 
officials that puts confidentiality 
stamps on dual-use research is 
not in the long-term interest of 
scientific progress. 

Biosecurity in research needs 
to be integrated into a more 
comprehensive strategy if it is to 
be effective and avoid harming 
public-health interests. 

Asa member and chair of 
several ethics-review panels 
of dual-use research for the 
European Union, I believe 
that these research projects, 
and their clearly foreseeable 
implications, should have 
undergone a proper risk- 
benefit assessment before 
funding. They could then have 
been modified to accommodate 
additional risk-management 
procedures. 

For example, threats to 
biosecurity could have been 
minimized by developing 
diagnostic kits for early 
detection and surveillance of 
the new genetic variants, and 
by testing possible treatment 
strategies. It seems that none of 
this was done. 

Johannes Rath University of 
Vienna, Austria. 
johannes.rath@univie.ac.at 


Questionable use of 
chimpanzees 


By conducting their experiments 
at US chimpanzee centres, 
foreign scientists have been 
circumventing their own nations’ 
bans on chimpanzee research 
since 2005 (Nature 482, 132; 
2012). Itis important to point out 
that those scientists are almost all 
employed by foreign-based drug 
companies — as reported by a 
US National Institutes of Health 
representative at the Institute of 
Medicine (IOM) public hearing 
in May 2011. 
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The US Food and Drug 
Administration's Office of New 
Drugs reported to the IOM 
committee in June 2011 that 
chimps are never required for 
preclinical drug testing in the 
United States, and that the 
agency discourages the use of 
chimps for this purpose. The 
IOM’s report Chimpanzees 
in Biomedical and Behavioral 
Research, released in December 
2011, also concludes that chimps 
are unnecessary for preclinical 
drug testing. 

The use of chimps for 
preclinical drug trials in 
US centres by foreign drug 
companies is therefore contrary 
to US practice and should be 
banned. 

John J. Pippin Physicians 
Committee for Responsible 
Medicine, Washington DC, USA. 


jpippin@pcrm.org 


Sugar: there’s more 
to the obesity crisis 


To describe sugar as “toxic” 

is extreme, as is its ludicrous 
comparison with alcohol 
(Nature 482, 27-29; 2012). Such 
sensationalism could damage 
the livelihoods of thousands 

of people working in the sugar 
industry worldwide, and will be 
felt in countries such as Australia, 
the United States, Fiji, Mauritius, 
Indonesia and India. 

As the senator for Queensland, 
Australia, where sugar is the 
most significant agricultural 
crop, I wish to voice the 
industry's concerns. Consumers 
should be assured that sugar is 
a safe ingredient and suitable 
for consumption as part of a 
balanced diet. 

Nutritionist Jennie Brand- 
Miller of the University of 
Sydney is not alone in her 
disgust that you published this 
opinion piece (The Australian, 

4 February 2012). The Dietitians 
Association of Australia believes 
that it is simplistic and unhelpful 
to blame sugar alone for the 
obesity crisis. 

Alan Barclay of the Australian 


Diabetes Council notes in the 
same article in The Australian that 
sugar consumption in Australia 
has dropped by 23% since 1980. 
But he adds that during that time, 
the number of overweight or 
obese people has doubled, while 
diabetes has tripled. 

A literature review by 
Australia’s National Health 
Medical Research Council, 
together with its draft dietary 
guidelines of December 2011, 
found that the evidence to 
support advice on added sugar 
and obesity was “limited, 
inconclusive or contradictory”. 

Robert Lustig et al. have 
stimulated debate, yet have 
unnecessarily tarnished 
the image of sugar. There is 
no evidence to suggest that 
reducing sugar consumption 
will halt the rise in obesity. The 
contributing factors are far more 
complex. 

Ron Boswell Brisbane, 
Queensland, Australia. 
senator. boswell@aph.gov.au 


Sugar: fruit fructose 
is still healthy 


Robert Lustig and colleagues 
argue that sugar is “toxic” 
(Nature 482, 27-29; 2012), 
focusing on the “deadly 
effect” of the fructose moiety 
of sucrose. But they are 
directing attention away 
from the problem of general 
overconsumption. 

Guidelines on healthy eating 
encourage fruit consumption, 
and fruit and fruit products 
are the third-largest source of 
fructose in the US diet. 

Our meta-analyses of 
controlled feeding trials 
indicate a net metabolic benefit, 
with no harmful effects, 
from fructose at a level of 
intake obtainable from fruit 
(J. L. Sievenpiper et al. Br. J. 
Nutr., in the press). 

John L. Sievenpiper, Russell J. 
de Souza, David J. A. Jenkins 
St Michael's Hospital, Toronto, 
Ontario, Canada. 
john.sievenpiper@utoronto.ca 


Sugar: a problem of 
developed countries 


The contribution of sugar 
towards chronic disease is more 
relevant to developed countries 
than to the developing world 
(Nature 482, 27-29; 2012). In 
Asia, for example, up to 10% of 
the population is obese and/or 
diabetic (see go.nature.com/ 
qmmoha), even though the 
daily energy contribution from 
sugar is less than 837 kilojoules 
per person. It is more likely 
that a high consumption of 
starch-based foods is to blame 
for this statistic (see go.nature. 
com/2hoimi) . 
Overconsumption of foods that 
have a high glycaemic index (that 
trigger a rapid and sharp increase 
in blood glucose), such as wheat, 
potatoes and certain types of rice, 
also contributes to obesity and. 
diabetes. Emphasis on sugar alone 
is therefore too narrow a basis for 
devising policies to curb these 
problems. 
Christiani Jeyakumar Henry, 
Viren Ranawana Clinical 
Nutrition Research Centre, 
Singapore Institute for Clinical 
Sciences, Singapore. 
jeya_henry@sics.a-star.edu.sg 


Sugar: other ‘toxic’ 
factors play a part 


Regulating products based on 
ascientific risk analysis is a 
worthy goal, but I contend that 
Robert Lustig and colleagues 


oversimplify the “toxic” truth 
about refined carbohydrates 
(Nature 482, 27-29; 2012). 
Rather than demonizing sugar, 
the authors would have better 
served public health with 
recommendations to manage a 
balanced diet with exercise. 

The authors also downplay 
other complex factors that 
could contribute to non- 
communicable disease burdens. 
These include relatively recent 
changes in exercise patterns, and 
pollutants and additives that 
affect metabolic activity. 

Putting sugars in a regulatory 
league with alcohol and tobacco 
is misleading. Sugars do not 
cause behavioural intoxication, 
nor do they have the second- 
hand proximity impact of 
tobacco smoking — key factors 
in their regulation. 

Saleem H. Ali University of 
Vermont, Burlington, USA. 
saleem.ali@uvm.edu 


Australia: small steps 
to control invasives 


We believe that there are more 
obvious and less destructive 
options for controlling gamba 
grass and other invasive weeds in 
Australia than introducing mega- 
herbivores such as elephants 
(Nature 482, 30; 2012). 
Biological control using 
carefully screened host-specific 
arthropods or pathogens, 
combined with quarantine and 
spread-prevention measures, is 


CORRESPONDENCE MRUMIM aN 


| 
oh 


amore balanced approach, and 
one with which Australia has 
considerable experience. 

The world is littered with 
examples of generalist vertebrate 
species (cane toads, foxes, mynas, 
mosquito fish and so on) that 
were introduced in the misguided. 
hope of controlling a pest 
species, only to have a substantial 
undesired impact on native 
biodiversity. 

Credible solutions to these 
problems are more likely to come 
from small things done well, 
rather than through elephantine, 
rhinocerine, or even asinine fixes. 
Bruce L. Webber, John K. Scott, 
Raphael K. Didham CSIRO 
Ecosystem Sciences, Floreat, 
Australia; and University of 
Western Australia, Crawley, 
Australia. 
bruce.webber@csiro.au 


Australia: better 
solutions to wildfires 


Among David Bowmans more 
outlandish suggestions for dealing 
with Australia’s massive problems 
of wildfires, feral animals and 
weeds, there are some workable 
ideas (Nature 482, 30; 2012). 
Some of these are already 
being implemented, such as the 
reinstatement of Aboriginal fire 
management in the north of the 
country. The Australian Wildlife 
Conservancy's prescribed-burn 
programme in the Kimberley 
region is having great success. 
These innovations are radical, 


but they are based on a sound 
ecological understanding. 
Richard J. Hobbs University 
of Western Australia, Crawley, 
Australia. 
richard.hobbs@uwa.edu.au 


Australia: a case for 
Aboriginal rangers 


David Bowman makes a strong 
case for employing Aboriginal 
people to manage their own 
land and to reinstate traditional 
fire practices in Australia 
(Nature 482, 30; 2012). This 
strategy could form the basis 

of a coordinated, long-term 
conservation service. 

It would also provide 
desperately needed employment 
for landowners, as well as 
supplying them with a reliable 
source of protein from hunting 
feral animals (N. Collier et al. 
Hum. Ecol. 39, 155-164; 2011). 

In addition, the Aboriginal 
people, who have a deep spiritual 
connection to land, would be able 
to remain on their traditional 
territories and so maintain close 
functional relationships with 
their ancestors. 

Clive R. McMahon Charles 
Darwin University, Darwin, 
Northern Territory, Australia. 
clive. mcmahon@cdu.edu.au 


Australia: no price 
on cutting fire risk 


David Bowman proposes that 
elephants should be introduced 
into Australia as a cost-effective 
way to control invasive gamba 
grass, a major source of wildfire 
fuel (Nature 482, 30; 2012). But 
managing the elephants could 
be more expensive than, say, 
launching a fleet of harvesters 
every year to reduce fire risk. We 
should start by asking what is 
likely to work best, regardless of 
the cost. 

To combat the problems caused 
by invasive aliens, we should 
implement ecologically sound 
control mechanisms that have a 
reasonable probability of success. 
We can worry about the bill later. 
P.J. Nico de Bruyn, Andrew B. 
Davies University of Pretoria, 
Hatfield, South Africa. 
pjndebruyn@zoology.up.ac.za 
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NEWS & VIEWS 


CELL BIOLOGY 


Collagen secretion explained 


Cells package proteins into vesicles for secretion to the extracellular milieu. A study has now identified an enzyme that 
modifies the packaging machinery to encapsulate unusually large proteins, such as collagen. SEE ARTICLE P.495 


DAVID J. STEPHENS 


ogether with other extracellular proteins, 
| collagen provides the structural frame- 
work on which tissues develop and 
function. It is synthesized in the endoplas- 
mic reticulum, an intracellular organelle, as a 
rigid, rod-like precursor (procollagen) about 
300 nanometres in length. Procollagen — like 
nearly all secreted proteins — is then pack- 
aged into transport vesicles for delivery to 
another organelle, the Golgi apparatus, before 
its secretion to the cell’s surroundings. Trans- 
port vesicles, however, are typically smaller 
than 100 nm, as they are generated from the 
endoplasmic reticulum by a group of proteins 
(the COPII coat) that co-assemble as a struc- 
turally defined polyhedral cage’. On page 495 
of this issue, Jin et al.’ reveal that modification of 
one of the COPII proteins allows the forma- 
tion of vesicles that are large enough to hold 
procollagen. 

The outer layer of the COPII coat is assem- 
bled using structural elements comprised 
of the proteins SEC13 and SEC31 (Fig. 1a). 
Although it was thought that the hinges 
between these elements are flexible enough to 
allow vesicles of various sizes to form*", little 
was known about how vesicle size is controlled. 
Jin and colleagues” show that SEC31 can be 
modified by ubiquitination — the attach- 
ment of one or more copies of a small protein 
called ubiquitin. Although ubiquitination can 
‘mark’ a protein for degradation, it is becoming 
increasingly clear that it can also affect protein 
function’. 

Specifically, the authors” report that, in 
mouse cells, the enzyme CUL3-KLHL12 
adds a single ubiquitin to a small pool of 
SEC31 molecules, and that this modifica- 
tion is required to drive the secretion of 
collagen. Using high-resolution electron 
microscopy, they found that overexpression of 
CUL3-KLHL12 leads to the production 
of large COPII structures, up to 500 nm in 
diameter — sufficient to accommodate pro- 
collagen. The simplest explanation for these 
observations is that ubiquitin attachment to 
SEC31 results in a structural change in the 
COPII cage that alters coat flexibility, and allows 
procollagen to be encapsulated in a nascent 
vesicle (Fig. 1b). 


COPII cage 


at 
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complex 
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Small transmembrane 
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protein 


CUL3-KLHL12 


a. 


Transport vesicle 


60-80 nm 


Membrane Cytosol 


Endoplasmic 
reticulum 
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Figure 1 | Big vesicles for collagen secretion. a, Soluble proteins targeted for secretion, together with 
small transmembrane proteins, are packaged at the endoplasmic reticulum into vesicles that are coated 
by the COPII protein cage. Proteins that will form the inner layer of the COPII coat associate in an 
ordered fashion and then recruit the proteins SEC13 and SEC31, which form the outer layer. This leads 
to membrane deformation and ultimately to scission of 60-80-nm transport vesicles. b, Large proteins 
such as procollagen (the collagen precursor) do not fit into these typical vesicles. Jin et al.’ report that, 
to encapsulate such large cargoes, the enzyme CUL3-KLHL12 attaches one copy of the small protein 
ubiquitin to SEC31 within the SEC13-SEC31 complex, and that this process facilitates collagen export. 
An additional, unknown protein might further stabilize lateral SEC13-SEC31 interactions. Although it 
is not known whether collagen synthesis directly triggers CUL3-KLHL12 activity, the transmembrane 
protein TANGO1 — which couples collagen in the endoplasmic reticulum to the assembling coat on the 


cytosolic face — might have a role in the process. 


Jin and colleagues’ observation that only 
some SEC31 molecules are modified indicates 
strongly that the addition of ubiquitin does not 
directly modulate the mechanics of COPII coat 
assembly. Instead, SEC31 ubiquitination might 
lead to recruitment of an additional, unknown 
protein to perform this role — for example, by 
further stabilizing lateral SEC13-SEC31 inter- 
actions. Identification of the additional factor 
and a more detailed molecular explanation of 
the modified geometry of the vesicle coat are 
challenges for the future. 

Ubiquitination of some SEC31 molecules 
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could be an ongoing process that facilitates 
the formation of large COPII vesicles as a 
routine cell function; alternatively, large vesi- 
cles might be formed only on demand. In the 
latter case, however, it is not immediately 
obvious how CUL3-KLHL12, located in the 
cytoplasm, would sense the presence of newly 
synthesized procollagen in the endoplasmic 
reticulum. A potential candidate for relay- 
ing this information across the endoplasmic 
reticulum membrane is the transmembrane 
protein TANGO1, which forms part of 
a packaging receptor that is essential for 


procollagen secretion®’. TANGO1, however, 
does not make contact with SEC31 directly, 
nor is it found in fully formed vesicles, and so 
its possible connection to CUL3-KLHL12 is 
unclear. 

Other questions remain. Does collagen 
become entirely encapsulated in a large COPII 
cage during vesicle formation (Fig. 1b), or 
does COPII somehow aid collagen export 
indirectly, without the need for a complete 
cage? And how does the addition of ubiqui- 
tin change the geometry of the COPII coat? 
Jin and colleagues’ findings might aid the 
development ofa cell-free system for study- 
ing COPII-dependent packaging of collagen 
that would help to address these issues. 


ASTROPHYSICS 


Moreover, is SEC31 ubiquitination relevant 
to the packaging of other large secreted 
macromolecules, such as lipoproteins? 

These questions are relevant to our 
understanding not only of the fundamental 
mechanisms of cellular secretion, but also of 
diseases in which secretion (particularly of 
collagen) is defective because of gene muta- 
tion®. Furthermore, manipulation of the 
CUL3-KLHL12 ubiquitination pathway 
might be used to increase collagen secre- 
tion from cells for applications in stem-cell 
culture, for growth of tissue components in 
regenerative medicine, or perhaps for amelio- 
rating age-related degeneration of connective 
tissue. 


First results from 
Planck observatory 


Early data from the Planck space satellite provide information about dust in 
distant galaxies, as well as in the Milky Way, and on the properties of gas in some 
of the largest clusters of galaxies in the Universe. 


UROS SELJAK 


stronomers have long known' that most 
Ae the stars in the Universe are born 

in messy environments containing 
dusty clouds. Young stars in such dust- 
enshrouded regions are not visible to optical 
telescopes; thus, multi-wavelength studies, 
from the radio to the X-ray regime, are used to 
better understand how stars form in our Gal- 
axy. But for more distant galaxies, including 


217 GHz 


Figure 1 | The cosmic infrared background. The images show the 
anisotropies, or irregularities, of the cosmic infrared background in 
three of the frequency channels (217 gigahertz, 353 GHz and 857 GHz) 
probed by the Planck observatory’ over a 26° x 26° patch of the sky. 
The anisotropies are visible as globular structures and correspond 


some of the first galaxies in the Universe, such 
dusty expanses are essentially invisible across 
most wavelengths. One exception is the wave- 
lengths in the far-infrared and microwave 
regimes, which are roughly 1,000 times longer 
than those of visible light. Stars heat up the 
dust surrounding them to temperatures of 
roughly 20 kelvin — much lower than that 
of the stars themselves, but nevertheless high 
enough for the dust to radiate microwave and 
far-infrared light. This warm-dust signature, 
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2 billion years old. 
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PWD 


called the cosmic infrared background, has 
now been observed’ by a large team of astrono- 
mers working with data from the Planck space 
observatory. The results are part of a series of 
studies that form a collection of 26 papers, 
published by the Planck team in Astronomy 
& Astrophysics (see go.nature.com/au8vap). 

The Planck satellite’s measurement of the 
cosmic infrared background’ improves on pre- 
vious measurements, including data’ obtained 
by Herschel, a twin observatory to Planck 
launched by the European Space Agency 
aboard the same rocket in 2009. The rocket 
carried them to the Earth-Sun Lagrangian 
point L2 (1.5 million kilometres from Earth in 
the opposite direction from the Sun), where the 
satellites can be stationary relative to both the 
Sun and Earth, allowing for shielding from the 
Sun’s radiation. 

Planck detects microwave light in several 
wavelength bands in which the warm-dust 
emission can be observed (Fig. 1). Because 
the Universe is expanding and the wavelength 
of light stretches with the expansion, the light 
that we observe has a longer wavelength than it 
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to dusty galaxies clumped together on large scales. As we move across 
frequency channels, different epochs of cosmic time become visible: 
observations at 217 GHz offer a glimpse of some of the oldest galaxies 
in the Universe, which formed when the Universe was less than 
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procollagen secretion®’. TANGO1, however, 
does not make contact with SEC31 directly, 
nor is it found in fully formed vesicles, and so 
its possible connection to CUL3-KLHL12 is 
unclear. 

Other questions remain. Does collagen 
become entirely encapsulated in a large COPII 
cage during vesicle formation (Fig. 1b), or 
does COPII somehow aid collagen export 
indirectly, without the need for a complete 
cage? And how does the addition of ubiqui- 
tin change the geometry of the COPII coat? 
Jin and colleagues’ findings might aid the 
development ofa cell-free system for study- 
ing COPII-dependent packaging of collagen 
that would help to address these issues. 
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Moreover, is SEC31 ubiquitination relevant 
to the packaging of other large secreted 
macromolecules, such as lipoproteins? 

These questions are relevant to our 
understanding not only of the fundamental 
mechanisms of cellular secretion, but also of 
diseases in which secretion (particularly of 
collagen) is defective because of gene muta- 
tion®. Furthermore, manipulation of the 
CUL3-KLHL12 ubiquitination pathway 
might be used to increase collagen secre- 
tion from cells for applications in stem-cell 
culture, for growth of tissue components in 
regenerative medicine, or perhaps for amelio- 
rating age-related degeneration of connective 
tissue. 


First results from 
Planck observatory 


Early data from the Planck space satellite provide information about dust in 
distant galaxies, as well as in the Milky Way, and on the properties of gas in some 
of the largest clusters of galaxies in the Universe. 
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stronomers have long known' that most 
Ae the stars in the Universe are born 

in messy environments containing 
dusty clouds. Young stars in such dust- 
enshrouded regions are not visible to optical 
telescopes; thus, multi-wavelength studies, 
from the radio to the X-ray regime, are used to 
better understand how stars form in our Gal- 
axy. But for more distant galaxies, including 


217 GHz 


Figure 1 | The cosmic infrared background. The images show the 
anisotropies, or irregularities, of the cosmic infrared background in 
three of the frequency channels (217 gigahertz, 353 GHz and 857 GHz) 
probed by the Planck observatory’ over a 26° x 26° patch of the sky. 
The anisotropies are visible as globular structures and correspond 


some of the first galaxies in the Universe, such 
dusty expanses are essentially invisible across 
most wavelengths. One exception is the wave- 
lengths in the far-infrared and microwave 
regimes, which are roughly 1,000 times longer 
than those of visible light. Stars heat up the 
dust surrounding them to temperatures of 
roughly 20 kelvin — much lower than that 
of the stars themselves, but nevertheless high 
enough for the dust to radiate microwave and 
far-infrared light. This warm-dust signature, 
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2 billion years old. 
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called the cosmic infrared background, has 
now been observed’ by a large team of astrono- 
mers working with data from the Planck space 
observatory. The results are part of a series of 
studies that form a collection of 26 papers, 
published by the Planck team in Astronomy 
& Astrophysics (see go.nature.com/au8vap). 

The Planck satellite’s measurement of the 
cosmic infrared background’ improves on pre- 
vious measurements, including data’ obtained 
by Herschel, a twin observatory to Planck 
launched by the European Space Agency 
aboard the same rocket in 2009. The rocket 
carried them to the Earth-Sun Lagrangian 
point L2 (1.5 million kilometres from Earth in 
the opposite direction from the Sun), where the 
satellites can be stationary relative to both the 
Sun and Earth, allowing for shielding from the 
Sun’s radiation. 

Planck detects microwave light in several 
wavelength bands in which the warm-dust 
emission can be observed (Fig. 1). Because 
the Universe is expanding and the wavelength 
of light stretches with the expansion, the light 
that we observe has a longer wavelength than it 
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to dusty galaxies clumped together on large scales. As we move across 
frequency channels, different epochs of cosmic time become visible: 
observations at 217 GHz offer a glimpse of some of the oldest galaxies 
in the Universe, which formed when the Universe was less than 
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Mist opportunities 


From the fixative properties of hairsprays to 
the stickiness of filaments on beetles’ feet, 
the wetting of flexible fibres with droplets 
of liquid is a universal phenomenon — but 
one we know surprisingly little about. On 
page 510 of this issue, Duprat et a/. formulate 
rules to describe how mists of droplets 
interact with flexible fibre arrays (C. Duprat, 
S. Protiére, A. Y. Beebe & H. A. Stone Nature 
482, 510-513; 2012). 

The researchers began with the 
simplest possible model: the interactions 
of water droplets with a pair of closely 
aligned, flexible glass fibres that were 
clamped at one end but free to bend at 
the other. They observed that a droplet 
deposited close to the clamped ends adopts 
one of three forms: it could remain as a 
tight, spherical bridge between the filaments, 
or, depending on the conditions, it could 
either partially or completely spread along 
the fibres, in the latter case causing them 
to coalesce. 

On further investigation, Duprat et a/. found 


had at its source. This means that, for the same 
dust temperature, observing the dust emis- 
sion at the longer wavelengths corresponds 
to observing an epoch when the Universe was 
smaller, and hence younger. By measuring the 
dust at different wavelengths, Planck can track 
the emission from star-forming galaxies as a 
function of cosmic time. Planck’s observations” 
suggest that most of the emission in the longer- 
wavelength bands comes from galaxies that 
formed at a time when the Universe was less 
than 2 billion years old (the age of the Universe 
today is approximately 14 billion years). 

To achieve this measurement, the Planck 
team performed’ a sophisticated software 
analysis called component separation. This 
was required because these wavelength bands 
contain radiation from many other sources, 
mostly the Milky Way, but also the cosmic 
microwave background (CMB, relic radia- 
tion from the early Universe glowing at 2.7 K). 
The strengths of these sources vary differ- 
ently as a function of wavelength. By com- 
bining Planck's nine wavelength bands with 
additional external measurements, the team 
was able to separate the cosmic-infrared- 
background component from the other sources 
of radiation. The authors found’ a broad 
agreement in results between different areas 
in the sky, which had been specially chosen 
for having low radiation from our Galaxy, 
suggesting that the component separation was 
successful. 

The emission from the Milky Way is not just 


that six physical parameters 
control droplet shape and 
spreading; they include 
fibre geometry, the distance 
between the fibres, and the 
fibres’ mechanical properties. 
The authors also identified 
a critical droplet volume 
above which fibres do not 
coalesce, and a second critical 
volume at which droplet capture by fibres is 
maximized. 

The team went on to explore the 
wetting of a natural fibre array by spraying a 
goose feather with oil droplets and observing 
the effects on the barbules (filaments 
projecting from each barb of a feather). They 
found that their theoretical model held up — 
small droplets spread along the barbules 
and caused barbule clumping, whereas 
larger droplets did not spread and could 
be easily dislodged — despite the 
roughness of the feather’s barbules and 
the chemical affinity between the droplet 


a contaminant of the cosmic infrared back- 
ground; it also contains some surprises of its 
own. One of these relates to the ‘anomalous 
microwave emission at centimetre wavelengths. 
This has been known about for a few years, but 
its origin has been controversial. In particular, 
although this radiation has been observed* to 
correlate with the emission from small dust 
grains in the Galaxy, simple models of thermal 
emission from dust could not explain its wave- 
length dependence. However, if the dust parti- 
cles are spinning at high rates, they can radiate 
at a wavelength that relates to their spinning fre- 
quency and size. In this spinning-dust model, 
the emission occurs over a relatively narrow 
range of wavelengths that happens to coincide 
with the longest-wavelength band of the Planck 
observatory. Planck's observations of emission 
from the Milky Way provide’ strong support 
for the spinning-dust model. 

Not all of the results from Planck are 
related to dust radiation. Light propagating 
through hot gas can be scattered off electrons 
zooming around these high-temperature 
regions. The result of this process, named the 
Sunyaev—Zeldovich (SZ) effect after the two 
Russian scientists who first proposed’ it, is that 
longer-wavelength light is shifted to shorter 
wavelengths. When viewed against the back- 
ground provided by the CMB radiation, this 
effect leads to a dark hole at longer wavelengths 
at the position ofa gas clump on the sky. Simi- 
larly, it causes a bright peak of light at shorter 
wavelengths at the same position. 
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and the barbules’ surfaces. 

Duprat and colleagues’ discoveries 
suggest that the mechanical properties 
and spatial organization of biological 
fibre arrays may have evolved to optimize 
interactions with liquid droplets, and so 
enhance functions such as adhesion, dew 
collection and self-cleaning. The work 
also offers opportunities for improving 
the performance of technological wetting 
systems — for example, droplet volumes 
in sprays could be engineered to fine-tune 
their wetting interactions with relevant 
fibres. Rosamund Daw 


With Planck’s many wavelength bands, both 
of these features can be observed, leading to 
a convincing detection of the SZ effect. The 
sources most likely to provide a detectable SZ 
signal are the most massive galaxy clusters, 
which contain huge amounts of some of the 
hottest gas in the Universe. The Planck team 
found’ nearly 200 cluster candidates with this 
technique, of which about 20 were previously 
unknown. Most of these have subsequently 
been confirmed as real clusters by follow-up 
studies, including X-ray observations® with the 
XMM-Newton satellite. Combined analysis of 
these data provides detailed information about 
the gas density and temperature distribution in 
the clusters, resulting in a better understanding 
of the processes that led to their formation. 

These new results’ demonstrate that it is 
possible to find clusters of galaxies with the 
SZ technique even for surveys looking at the 
entire sky, in contrast to previous SZ detec- 
tions — by the South Pole Telescope’ and 
Atacama Cosmology Telescope’ — that 
searched smaller patches of the sky. Ulti- 
mately, the SZ method will allow clusters to be 
observed at a much larger distance from Earth 
than is possible with other methods, such as 
X-ray emission. One exciting application of 
the SZ approach would be to probe the growth 
of the largest (and thus rarest) structures at 
early times. Such observations would provide 
a measurement of the different components 
that make up the Universe and of the size of 
the initial density fluctuations that eventually 
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grew to become galaxies and galaxy clusters. 
The early results from Planck demonstrate 
that the observatory is working flawlessly, and 
provide a first glimpse of its scientific poten- 
tial. However, the best is yet to come. The main 
mission of Planck is to map the CMB radia- 
tion and its polarization with unprecedented 
precision. This measurement will provide a 
window onto the early Universe and offer clues 
as to what created the first seeds of structure. 
Planck may also detect the relic gravity waves 
from the Big Bang through the observations 
of CMB polarization. The task is complicated 
by the relative faintness of the CMB com- 
pared with other sources of radiation, such 
as dust emission, in most of the wavelength 


MATERIALS SCIENCE 


bands. Careful separation of components is 
thus needed to isolate the CMB signal, a task 
that has proved challenging and is the main 
reason that these early results do not include 
any primary CMB data. These CMB results 
are expected to be announced in early 2013. 
Given the spectacular instrument performance 
of Planck shown by its early findings”””, the 
cosmology community is eagerly awaiting 
more results. m 
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Cell environments 
programmed with light 


A combination of two light-induced reactions has been used to attach peptides to 
a polymeric gel, and then to detach them from it. This feat opens up opportunities 
for studying the effects of signalling molecules on cell behaviour in vitro. 
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These models have been generated from 
crosslinked networks of protein components 
of the ECM such as collagen, or from ECM 
glycoproteins (polypeptides that have sugars 
attached) such as laminin. They have provided 
vital insight into extrinsic cell regulation, and 
in some cases have even made possible the for- 
mation of entire tissues from single stem cells 
in vitro’. Unfortunately, these biomatrices tend 
to suffer from uncontrollable batch-to-batch 
variability and are unable to modulate the 
availability of extrinsic signalling molecules 


MATTHIAS P. LUTOLF 


he ability to use light to precisely control 

the activity of cells has transformed the 

way many experiments in biology are 
performed. In particular, optogenetic tech- 
niques — in which light is used to manipulate 
cells that have been genetically engineered 
to be light responsive — have revolutionized 
neuroscience by providing a completely new 
way to modulate cell signalling, even in live 
animals’. Writing in Angewandte Chemie, 
DeForest and Anseth’ report that light can 
be used to dynamically manipulate not only 
the intrinsic cellular regulatory machinery, 
but also the external microenvironment of 
a cell. Specifically, they showcase a class of 
‘optobiomaterial’ whose biochemical proper- 
ties can be changed to influence cellular activ- 
ity simply by having different sources of light 
shone on it. 

Far from being intrinsically determined, 
cell behaviour such as proliferation, differen- 
tiation and migration are tightly regulated by 
spatio-temporally complex signals originating 
from the surrounding milieu (the extracellular 
matrix, ECM). For instance, the micro- 
environments (known as niches) surrounding 
rare adult stem cells in human tissues regulate 
stem-cell behaviour using a combination of 
local cell-cell interactions, ECM-derived sig- 
nals and soluble signalling molecules. Together, 
these niche signals are crucial for ensuring life- 
long maintenance of stem-cell function*. An 


understanding of how stem cells respond to 
signals from their extracellular environment is 
therefore essential, especially for realizing the 
therapeutic potential of stem cells. 

Biologists have a variety of in vitro model 
systems at hand to study such complex cell- 
ECM interactions, and this enables them to 
uncover cell-signalling mechanisms in near- 
physiological, three-dimensional contexts. 


and thus cell function — controllably in 
space and time. 

To recreate the dynamics of cellular micro- 
environments in three dimensions, researchers 
have sought strategies in materials chemistry 
that permit the biophysical and biochemi- 
cal properties of matrices to be selectively 
modulated in a tailor-made fashion. Most 
approaches rely on well-characterized, cross- 
linked, synthetic polymers known as hydro- 
gels that have ECM-like biophysical properties. 


Figure 1 | Reversible gel patterning. DeForest and Anseth’ have prepared hydrogels — water-absorbent 
polymeric networks — to which biologically active molecules can be attached and then removed 

using two light-induced reactions. By focusing light on specific regions of the gel, the authors precisely 
controlled the points of attachment. a, In this three-dimensional section of a hydrogel, fluorescently 
labelled peptides are bound in a double-helix pattern that was traced out using focused, visible laser light. 
False colour has been used to aid visualization. b, Subsequent irradiation of the red part of the helix with 
ultraviolet light has caused the peptides in that region to detach. Scale bars, 200 micrometres. (Images 


reproduced from ref. 2.) 
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Mist opportunities 


From the fixative properties of hairsprays to 
the stickiness of filaments on beetles’ feet, 
the wetting of flexible fibres with droplets 
of liquid is a universal phenomenon — but 
one we know surprisingly little about. On 
page 510 of this issue, Duprat et a/. formulate 
rules to describe how mists of droplets 
interact with flexible fibre arrays (C. Duprat, 
S. Protiére, A. Y. Beebe & H. A. Stone Nature 
482, 510-513; 2012). 

The researchers began with the 
simplest possible model: the interactions 
of water droplets with a pair of closely 
aligned, flexible glass fibres that were 
clamped at one end but free to bend at 
the other. They observed that a droplet 
deposited close to the clamped ends adopts 
one of three forms: it could remain as a 
tight, spherical bridge between the filaments, 
or, depending on the conditions, it could 
either partially or completely spread along 
the fibres, in the latter case causing them 
to coalesce. 

On further investigation, Duprat et a/. found 


had at its source. This means that, for the same 
dust temperature, observing the dust emis- 
sion at the longer wavelengths corresponds 
to observing an epoch when the Universe was 
smaller, and hence younger. By measuring the 
dust at different wavelengths, Planck can track 
the emission from star-forming galaxies as a 
function of cosmic time. Planck’s observations” 
suggest that most of the emission in the longer- 
wavelength bands comes from galaxies that 
formed at a time when the Universe was less 
than 2 billion years old (the age of the Universe 
today is approximately 14 billion years). 

To achieve this measurement, the Planck 
team performed’ a sophisticated software 
analysis called component separation. This 
was required because these wavelength bands 
contain radiation from many other sources, 
mostly the Milky Way, but also the cosmic 
microwave background (CMB, relic radia- 
tion from the early Universe glowing at 2.7 K). 
The strengths of these sources vary differ- 
ently as a function of wavelength. By com- 
bining Planck's nine wavelength bands with 
additional external measurements, the team 
was able to separate the cosmic-infrared- 
background component from the other sources 
of radiation. The authors found’ a broad 
agreement in results between different areas 
in the sky, which had been specially chosen 
for having low radiation from our Galaxy, 
suggesting that the component separation was 
successful. 

The emission from the Milky Way is not just 


that six physical parameters 
control droplet shape and 
spreading; they include 
fibre geometry, the distance 
between the fibres, and the 
fibres’ mechanical properties. 
The authors also identified 
a critical droplet volume 
above which fibres do not 
coalesce, and a second critical 
volume at which droplet capture by fibres is 
maximized. 

The team went on to explore the 
wetting of a natural fibre array by spraying a 
goose feather with oil droplets and observing 
the effects on the barbules (filaments 
projecting from each barb of a feather). They 
found that their theoretical model held up — 
small droplets spread along the barbules 
and caused barbule clumping, whereas 
larger droplets did not spread and could 
be easily dislodged — despite the 
roughness of the feather’s barbules and 
the chemical affinity between the droplet 


a contaminant of the cosmic infrared back- 
ground; it also contains some surprises of its 
own. One of these relates to the ‘anomalous 
microwave emission at centimetre wavelengths. 
This has been known about for a few years, but 
its origin has been controversial. In particular, 
although this radiation has been observed* to 
correlate with the emission from small dust 
grains in the Galaxy, simple models of thermal 
emission from dust could not explain its wave- 
length dependence. However, if the dust parti- 
cles are spinning at high rates, they can radiate 
at a wavelength that relates to their spinning fre- 
quency and size. In this spinning-dust model, 
the emission occurs over a relatively narrow 
range of wavelengths that happens to coincide 
with the longest-wavelength band of the Planck 
observatory. Planck's observations of emission 
from the Milky Way provide’ strong support 
for the spinning-dust model. 

Not all of the results from Planck are 
related to dust radiation. Light propagating 
through hot gas can be scattered off electrons 
zooming around these high-temperature 
regions. The result of this process, named the 
Sunyaev—Zeldovich (SZ) effect after the two 
Russian scientists who first proposed’ it, is that 
longer-wavelength light is shifted to shorter 
wavelengths. When viewed against the back- 
ground provided by the CMB radiation, this 
effect leads to a dark hole at longer wavelengths 
at the position ofa gas clump on the sky. Simi- 
larly, it causes a bright peak of light at shorter 
wavelengths at the same position. 
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and the barbules’ surfaces. 

Duprat and colleagues’ discoveries 
suggest that the mechanical properties 
and spatial organization of biological 
fibre arrays may have evolved to optimize 
interactions with liquid droplets, and so 
enhance functions such as adhesion, dew 
collection and self-cleaning. The work 
also offers opportunities for improving 
the performance of technological wetting 
systems — for example, droplet volumes 
in sprays could be engineered to fine-tune 
their wetting interactions with relevant 
fibres. Rosamund Daw 


With Planck’s many wavelength bands, both 
of these features can be observed, leading to 
a convincing detection of the SZ effect. The 
sources most likely to provide a detectable SZ 
signal are the most massive galaxy clusters, 
which contain huge amounts of some of the 
hottest gas in the Universe. The Planck team 
found’ nearly 200 cluster candidates with this 
technique, of which about 20 were previously 
unknown. Most of these have subsequently 
been confirmed as real clusters by follow-up 
studies, including X-ray observations® with the 
XMM-Newton satellite. Combined analysis of 
these data provides detailed information about 
the gas density and temperature distribution in 
the clusters, resulting in a better understanding 
of the processes that led to their formation. 

These new results’ demonstrate that it is 
possible to find clusters of galaxies with the 
SZ technique even for surveys looking at the 
entire sky, in contrast to previous SZ detec- 
tions — by the South Pole Telescope’ and 
Atacama Cosmology Telescope’ — that 
searched smaller patches of the sky. Ulti- 
mately, the SZ method will allow clusters to be 
observed at a much larger distance from Earth 
than is possible with other methods, such as 
X-ray emission. One exciting application of 
the SZ approach would be to probe the growth 
of the largest (and thus rarest) structures at 
early times. Such observations would provide 
a measurement of the different components 
that make up the Universe and of the size of 
the initial density fluctuations that eventually 
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grew to become galaxies and galaxy clusters. 
The early results from Planck demonstrate 
that the observatory is working flawlessly, and 
provide a first glimpse of its scientific poten- 
tial. However, the best is yet to come. The main 
mission of Planck is to map the CMB radia- 
tion and its polarization with unprecedented 
precision. This measurement will provide a 
window onto the early Universe and offer clues 
as to what created the first seeds of structure. 
Planck may also detect the relic gravity waves 
from the Big Bang through the observations 
of CMB polarization. The task is complicated 
by the relative faintness of the CMB com- 
pared with other sources of radiation, such 
as dust emission, in most of the wavelength 


MATERIALS SCIENCE 


bands. Careful separation of components is 
thus needed to isolate the CMB signal, a task 
that has proved challenging and is the main 
reason that these early results do not include 
any primary CMB data. These CMB results 
are expected to be announced in early 2013. 
Given the spectacular instrument performance 
of Planck shown by its early findings”””, the 
cosmology community is eagerly awaiting 
more results. m 
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Cell environments 
programmed with light 


A combination of two light-induced reactions has been used to attach peptides to 
a polymeric gel, and then to detach them from it. This feat opens up opportunities 
for studying the effects of signalling molecules on cell behaviour in vitro. 
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These models have been generated from 
crosslinked networks of protein components 
of the ECM such as collagen, or from ECM 
glycoproteins (polypeptides that have sugars 
attached) such as laminin. They have provided 
vital insight into extrinsic cell regulation, and 
in some cases have even made possible the for- 
mation of entire tissues from single stem cells 
in vitro’. Unfortunately, these biomatrices tend 
to suffer from uncontrollable batch-to-batch 
variability and are unable to modulate the 
availability of extrinsic signalling molecules 
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he ability to use light to precisely control 

the activity of cells has transformed the 

way many experiments in biology are 
performed. In particular, optogenetic tech- 
niques — in which light is used to manipulate 
cells that have been genetically engineered 
to be light responsive — have revolutionized 
neuroscience by providing a completely new 
way to modulate cell signalling, even in live 
animals’. Writing in Angewandte Chemie, 
DeForest and Anseth’ report that light can 
be used to dynamically manipulate not only 
the intrinsic cellular regulatory machinery, 
but also the external microenvironment of 
a cell. Specifically, they showcase a class of 
‘optobiomaterial’ whose biochemical proper- 
ties can be changed to influence cellular activ- 
ity simply by having different sources of light 
shone on it. 

Far from being intrinsically determined, 
cell behaviour such as proliferation, differen- 
tiation and migration are tightly regulated by 
spatio-temporally complex signals originating 
from the surrounding milieu (the extracellular 
matrix, ECM). For instance, the micro- 
environments (known as niches) surrounding 
rare adult stem cells in human tissues regulate 
stem-cell behaviour using a combination of 
local cell-cell interactions, ECM-derived sig- 
nals and soluble signalling molecules. Together, 
these niche signals are crucial for ensuring life- 
long maintenance of stem-cell function*. An 


understanding of how stem cells respond to 
signals from their extracellular environment is 
therefore essential, especially for realizing the 
therapeutic potential of stem cells. 

Biologists have a variety of in vitro model 
systems at hand to study such complex cell- 
ECM interactions, and this enables them to 
uncover cell-signalling mechanisms in near- 
physiological, three-dimensional contexts. 


and thus cell function — controllably in 
space and time. 

To recreate the dynamics of cellular micro- 
environments in three dimensions, researchers 
have sought strategies in materials chemistry 
that permit the biophysical and biochemi- 
cal properties of matrices to be selectively 
modulated in a tailor-made fashion. Most 
approaches rely on well-characterized, cross- 
linked, synthetic polymers known as hydro- 
gels that have ECM-like biophysical properties. 


Figure 1 | Reversible gel patterning. DeForest and Anseth’ have prepared hydrogels — water-absorbent 
polymeric networks — to which biologically active molecules can be attached and then removed 

using two light-induced reactions. By focusing light on specific regions of the gel, the authors precisely 
controlled the points of attachment. a, In this three-dimensional section of a hydrogel, fluorescently 
labelled peptides are bound in a double-helix pattern that was traced out using focused, visible laser light. 
False colour has been used to aid visualization. b, Subsequent irradiation of the red part of the helix with 
ultraviolet light has caused the peptides in that region to detach. Scale bars, 200 micrometres. (Images 


reproduced from ref. 2.) 
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But, in contrast to ECMs, most synthetic 
hydrogels are biologically inert because their 
polymer ‘backbones’ contain no biologically 
active components. This allows researchers 
to design very ‘clean’ experimental systems: 
biologically active molecules can be attached 
to hydrogels to perturb cell activity in a well- 
controlled fashion, without interference from 
the hydrogel itself. 

Several research groups have carried out 
work in which light-sensitive molecular 
building blocks were attached to hydrogel 
networks to generate artificial ECMs in which 
the properties of microenvironments could be 
specifically modulated by light exposure’. For 
example, the introduction of chemical groups 
that can be cleaved by ultraviolet light has led 
to hydrogels that soften on light exposure’. 
Conversely, the incorporation of groups that 
form crosslinks between polymer chains when 
irradiated with ultraviolet light has resulted in 
materials that stiffen upon such irradiation’. 

Systems in which light triggers the 
coupling*” or removal’ of biologically active 
molecules to or from hydrogel polymer net- 
works have also been devised. These light- 
mediated approaches to modifying hydrogels 
have been used to control some aspects of the 
basic three-dimensional behaviour of cells 
embedded in the materials, such as adhesion 
to the artificial ECM or migration. But because 
the modifications involved are irreversible, 
they allow only one-way manipulation of 
cell activity. 

DeForest and Anseth’s work” now dem- 
onstrates fully reversible modulation of 
biologically active building blocks within 
light-sensitive hydrogels. They synthesized 
small peptides that can act as signals for cell 
adhesion, to which a short linker section 
was attached. The free end of the linker was 
a chemical group that can react with alkene 
groups in a hydrogel when irradiated with 
visible light, thereby attaching the peptide 
to the gel (Fig. 1). Another part of the linker 
was a group that breaks apart when irradiated 
with ultraviolet light; by shining this light on 
a hydrogel that had been decorated with the 
peptides, the authors could detach the peptides 
from the gel. 

Crucially, both light-activated reactions are 
cell-compatible, which allowed DeForest and 
Anseth to attach (or detach) the peptides to (or 
from) their hydrogel in the presence of mouse 
embryonic fibroblast cells. By controlling 
precisely when and where the cell-adhesive 
peptides bound in the gel, the authors could 
control the duration and locations in which the 
cells attached and spread. 

In a first gel-patterning step, DeForest 
and Anseth used visible light to create small 
‘islands’ of peptides to which fibroblasts grown 
in culture with the gel adhered. In a second 
step, conducted after one day of culture, the 
authors removed peptides from areas of the 
islands using ultraviolet light. This caused 


rapid, selective detachment of cells from those 
areas. The authors showed that the removed 
cells could then be grown again in culture, 
or analysed in other assays. As DeForest and 
Anseth suggest’, this kind of protocol could be 
widely used to manipulate and study subsets 
of cells (or even individual cells) of larger cell 
populations. 

One long-term goal of work such as this 
is the development of materials to act as 
scaffolds for tissue regeneration. Can we 
expect this and/or similar techniques to trans- 
form tissue engineering in the same way that 
optogenetics is transforming neuroscience? 
This is, of course, difficult to predict. For 
DeForest and Anseth’s hydrogel to be fully 
physiologically relevant, the ability to attach 
and release full-length proteins’ — rather than 
short peptides — to the material needs to be 
developed. And it remains to be seen whether 
their approach is directly translatable to tissue 
regeneration in vivo. Furthermore, it could 
be argued that these methods will be valuable 
for tissue regeneration in only a relatively few 
cases, such as those in which much simpler 
scaffolds fail, because the spatial arrange- 
ment of ECM signals is necessary for driving 
regeneration. 

Nevertheless, the reversible, dynamic control 
of chemical and physical gel properties should 
allow previously impossible experiments to 
be performed in cell culture. For example, it 
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might be used to investigate how individual 
stem cells differentiate or renew themselves in 
response to changes in signals from an arti- 
ficial microenvironment that spatially resem- 
bles natural stem-cell niches. Alternatively, 
three-dimensional environments for stem cells 
could be made in which the display or release 
of molecular signals is graded, to mimic pro- 
cesses that occur during the embryonic devel- 
opment of an organism. DeForest and Anseth’s 
optobiomaterials therefore represent a major 
contribution to a nascent field in stem-cell 
bioengineering. m 
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A topological route to 
error correction 


Quantum computing is plagued by noise and small errors. An approach based on 
topological techniques reduces the sensitivity to errors and boosts the prospects 
for building practical quantum computers. SEE ARTICLE P.489 


JAMES D. FRANSON 


uantum computers have the potential 

to solve numerical problems that would 

be impossible on a classical computer. 
Roughly speaking, the superposition princi- 
ple of quantum mechanics allows a quantum 
computer to perform many calculations 
simultaneously on a single processor, and 
entanglement (non-classical correlations) 
provides an exponential increase in its 
memory capacity. Unfortunately, the same 
properties that enhance the computational 
power of a quantum computer also make it 
sensitive to errors produced by interactions 
with the environment or by imperfect logic 
operations. In this issue, Yao et al.’ (page 489) 
describe the first experimental demonstration 
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of a technique that uses topological effects to 
reduce the sensitivity of a quantum computer 
to errors. 

The bits in a quantum computer, commonly 
referred to as qubits, can be represented by a 
two-state quantum system, such as the two 
quantized energy levels of an atom (Fig. 1a). 
One state represents a logical value of ‘0’ and 
the other state represents a ‘1’ — meaning that, 
like a classical computer, a quantum computer 
is a digital device. But unlike classical phys- 
ics, quantum mechanics allows situations in 
which both possibilities (0 or 1) exist simul- 
taneously. The probability of finding the sys- 
tem in the 0 or 1 state is equal to the square of 
a complex number known as the probability 
amplitude. Asa result, the information stored 
ina qubit corresponds to a continuous range of 
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But, in contrast to ECMs, most synthetic 
hydrogels are biologically inert because their 
polymer ‘backbones’ contain no biologically 
active components. This allows researchers 
to design very ‘clean’ experimental systems: 
biologically active molecules can be attached 
to hydrogels to perturb cell activity in a well- 
controlled fashion, without interference from 
the hydrogel itself. 

Several research groups have carried out 
work in which light-sensitive molecular 
building blocks were attached to hydrogel 
networks to generate artificial ECMs in which 
the properties of microenvironments could be 
specifically modulated by light exposure’. For 
example, the introduction of chemical groups 
that can be cleaved by ultraviolet light has led 
to hydrogels that soften on light exposure’. 
Conversely, the incorporation of groups that 
form crosslinks between polymer chains when 
irradiated with ultraviolet light has resulted in 
materials that stiffen upon such irradiation’. 

Systems in which light triggers the 
coupling*” or removal’ of biologically active 
molecules to or from hydrogel polymer net- 
works have also been devised. These light- 
mediated approaches to modifying hydrogels 
have been used to control some aspects of the 
basic three-dimensional behaviour of cells 
embedded in the materials, such as adhesion 
to the artificial ECM or migration. But because 
the modifications involved are irreversible, 
they allow only one-way manipulation of 
cell activity. 

DeForest and Anseth’s work” now dem- 
onstrates fully reversible modulation of 
biologically active building blocks within 
light-sensitive hydrogels. They synthesized 
small peptides that can act as signals for cell 
adhesion, to which a short linker section 
was attached. The free end of the linker was 
a chemical group that can react with alkene 
groups in a hydrogel when irradiated with 
visible light, thereby attaching the peptide 
to the gel (Fig. 1). Another part of the linker 
was a group that breaks apart when irradiated 
with ultraviolet light; by shining this light on 
a hydrogel that had been decorated with the 
peptides, the authors could detach the peptides 
from the gel. 

Crucially, both light-activated reactions are 
cell-compatible, which allowed DeForest and 
Anseth to attach (or detach) the peptides to (or 
from) their hydrogel in the presence of mouse 
embryonic fibroblast cells. By controlling 
precisely when and where the cell-adhesive 
peptides bound in the gel, the authors could 
control the duration and locations in which the 
cells attached and spread. 

In a first gel-patterning step, DeForest 
and Anseth used visible light to create small 
‘islands’ of peptides to which fibroblasts grown 
in culture with the gel adhered. In a second 
step, conducted after one day of culture, the 
authors removed peptides from areas of the 
islands using ultraviolet light. This caused 


rapid, selective detachment of cells from those 
areas. The authors showed that the removed 
cells could then be grown again in culture, 
or analysed in other assays. As DeForest and 
Anseth suggest’, this kind of protocol could be 
widely used to manipulate and study subsets 
of cells (or even individual cells) of larger cell 
populations. 

One long-term goal of work such as this 
is the development of materials to act as 
scaffolds for tissue regeneration. Can we 
expect this and/or similar techniques to trans- 
form tissue engineering in the same way that 
optogenetics is transforming neuroscience? 
This is, of course, difficult to predict. For 
DeForest and Anseth’s hydrogel to be fully 
physiologically relevant, the ability to attach 
and release full-length proteins’ — rather than 
short peptides — to the material needs to be 
developed. And it remains to be seen whether 
their approach is directly translatable to tissue 
regeneration in vivo. Furthermore, it could 
be argued that these methods will be valuable 
for tissue regeneration in only a relatively few 
cases, such as those in which much simpler 
scaffolds fail, because the spatial arrange- 
ment of ECM signals is necessary for driving 
regeneration. 

Nevertheless, the reversible, dynamic control 
of chemical and physical gel properties should 
allow previously impossible experiments to 
be performed in cell culture. For example, it 
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might be used to investigate how individual 
stem cells differentiate or renew themselves in 
response to changes in signals from an arti- 
ficial microenvironment that spatially resem- 
bles natural stem-cell niches. Alternatively, 
three-dimensional environments for stem cells 
could be made in which the display or release 
of molecular signals is graded, to mimic pro- 
cesses that occur during the embryonic devel- 
opment of an organism. DeForest and Anseth’s 
optobiomaterials therefore represent a major 
contribution to a nascent field in stem-cell 
bioengineering. m 
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A topological route to 
error correction 


Quantum computing is plagued by noise and small errors. An approach based on 
topological techniques reduces the sensitivity to errors and boosts the prospects 
for building practical quantum computers. SEE ARTICLE P.489 
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uantum computers have the potential 

to solve numerical problems that would 

be impossible on a classical computer. 
Roughly speaking, the superposition princi- 
ple of quantum mechanics allows a quantum 
computer to perform many calculations 
simultaneously on a single processor, and 
entanglement (non-classical correlations) 
provides an exponential increase in its 
memory capacity. Unfortunately, the same 
properties that enhance the computational 
power of a quantum computer also make it 
sensitive to errors produced by interactions 
with the environment or by imperfect logic 
operations. In this issue, Yao et al.’ (page 489) 
describe the first experimental demonstration 
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of a technique that uses topological effects to 
reduce the sensitivity of a quantum computer 
to errors. 

The bits in a quantum computer, commonly 
referred to as qubits, can be represented by a 
two-state quantum system, such as the two 
quantized energy levels of an atom (Fig. 1a). 
One state represents a logical value of ‘0’ and 
the other state represents a ‘1’ — meaning that, 
like a classical computer, a quantum computer 
is a digital device. But unlike classical phys- 
ics, quantum mechanics allows situations in 
which both possibilities (0 or 1) exist simul- 
taneously. The probability of finding the sys- 
tem in the 0 or 1 state is equal to the square of 
a complex number known as the probability 
amplitude. Asa result, the information stored 
ina qubit corresponds to a continuous range of 
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Figure 1 | Measuring quantum bits. a, Five 
qubits represented by the energy levels of five 
atoms (spheres). The ground state encodes the 
logical value ‘0°, whereas the excited state encodes 
the value ‘Ll’ Because of their quantum nature, 

the atoms can be in both states simultaneously. 
The intensity of the spheres’ colour denotes the 
probability that an energy level is occupied, and 
the colours indicate the phase of the oscillating 
probability amplitude associated with the energy 
level. b, After a measurement, the qubits collapse 
to classical states with a specific value of 0 or 1, and 
the phase information is destroyed, as illustrated 
by the black colour. c, A three-dimensional array 
of qubits can be used to implement topological 
error correction, which reduces the sensitivity of 
quantum computing to errors. The calculation 
consists of a series of measurements that proceeds 
from all of the qubits in the plane that forms the 
left side of the array through the adjacent planes 
to the right. Yao et al.' demonstrated topological 
error correction using an ensemble of eight qubits. 


probability amplitudes. According to quantum 
mechanics, the probability amplitudes of both 
the 0 and 1 states have wave-like properties, 
and their relative position in an oscillatory 
cycle corresponds to an additional degree of 
freedom known as their phase. In addition, 
the qubits can be entangled with one another 
in many different ways. Thus a qubit can 
contain much more information than a 
classical bit, which can have only a specific 
value of 0 or 1. 

Measuring the value of a qubit causes it to 
collapse to a specific value of 0 or 1, reducing 
it to a classical bit (Fig. 1b). Because measur- 
ing a qubit destroys its quantum-mechanical 


properties, it was not initially apparent whether 
there was any way to correct for errors in qubits 
without destroying them. It was subsequently 
shown’ that error correction was possible if a 
‘logical’ qubit was constructed from a combi- 
nation of multiple physical qubits (Fig. 1a). 
For example, the value of the logical qubit 
can be taken to be the parity of the ensemble 
of physical qubits, where parity is defined 
to be 0 if the sum of the qubit values is even 
and 1 if the sum is odd. But there are more 
efficient ways of encoding the logical informa- 
tion. Quantum logic operations on the qubit 
ensemble can be used to correct the errors in 
the individual qubits without measuring the 
value of the logical qubit, and thus without 
destroying the information it encodes. This 
allows the errors in a quantum computer to 
be made arbitrarily small — although addi- 
tional errors will be introduced during the 
error-correction process itself, so the average 
error rate must be below a threshold on the 
order of 10“ for conventional error-correction 
techniques. 

In the type of topological error correction** 
used by Yao et al.’, the logical qubits are distrib- 
uted over a lattice of physical qubits in such a 
way that the information is automatically pro- 
tected against most forms of error. This type 
of error correction is theoretically expected to 
increase the tolerance for errors to above 1%. 
The authors' demonstrated topological error 
correction by combining topological tech- 
niques with cluster-state quantum computing’, 
in which a three-dimensional array, or 
cluster, of qubits is prepared with a carefully 
chosen form of entanglement between nearest 
neighbours in the array. 

Their approach begins with measurement of 
all the qubits in the plane that forms the left side 
of the array (Fig. 1c). The results of those meas- 
urements are then used to decide what kind 
of measurements to perform on the next layer 
of adjacent qubits. No active logic operations 
are performed — instead, the calculation 
depends on choosing the measurements in 
such a way that the collapse of the quantum 
state produces the desired logical operations”. 
The calculation proceeds until the final layer 
of qubits on the far right side of the array — 
the values of which give the desired output 
of the calculation — is reached. The authors 
reduced the sensitivity of the calculations to 
environmental noise and small errors in the 
logical operations by optimizing the spatial 
arrangement, or topology, of the qubits and 
the measurements. 

Yao and colleagues’ performed their experi- 
ment using an eight-qubit cluster, in which the 
value of each qubit (0 or 1) was represented by 
the polarization of a single photon (the direc- 
tion of the photon’s electric field). An optical 
route to quantum computing has the advan- 
tage that optical fibres can be readily used to 
transfer qubits from one location to another. 
The greatest challenge of an optical approach 
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50 Years Ago 


Although the annual figures for 
carriage-rates of all pathogenic 
staphylococci follow no particular 
course, evidence from many sources 
in industrialized countries shows 
that this is not the case with regard 
to the proportions of penicillin- 
resistant organisms ... These 
findings raise many questions about 
the origin and spread of resistant 
strains. They are certainly consistent 
with the general impression of a 
relationship between the increased 
use of penicillin and the growth 

of resistant strains ... Ithas to be 
borne in mind that penicillin, with 
other antibiotics, is being used on a 
large scale for preserving food and 
controlling animal diseases in many 
countries. It is increasingly present 
in milk and cheese, and quite large 
numbers of hospital, veterinary and 
farm workers are intermittently 

or continuously exposed to small 
concentrations of the antibiotic. 
These are all factors likely to 
promote the emergence of resistant 
strains in man. 

From Nature 24 February 1962 


100 Years Ago 


By the death of Lord Lister, the 
world has lost one of its greatest 
men... it was his work which 
gave the main impulse to the 
development of the great science of 
bacteriology, a science which bids 
fair to occupy the most prominent 
place in medical work ... Until 
Pasteur’s time the existence of 
bacteria and their life-history 

had been looked on as only an 
interesting but not very important 
study ... As soon as Lister showed 
that the exclusion of these 
organisms from wounds meant 
the disappearance of a variety of 
diseases to which man had been 
previously subject, the study of 
these organisms naturally advanced 
with great rapidity. 

From Nature 22 February 1912 
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is developing an efficient mechanism for 
generating large numbers of entangled photons 
on demand. The authors were able to enhance 
the efficiency of their entangled-photon source 
using quantum-interference techniques, but 
further improvements in photon sources will 
be necessary. 

Topological error correction could also 
be performed using qubits based on other 
physical systems, such as superconducting 
devices or trapped ions, which, like optical 
approaches, have allowed strong progress to 
be made in quantum computing. Other forms 
of topological error correction® may be able to 
further reduce the sensitivity to experimental 
errors beyond that achieved by the authors. 


STRUCTURAL BIOLOGY 


For example, it is possible to produce a change 
in the phase of a probability amplitude that 
depends only on the number of times that the 
trajectory of a quantum system circles a specific 
point in a complex mathematical space known 
as Hilbert space, regardless of the exact shape of 
the trajectory. Topological error correction can 
increase the tolerance for experimental errors to 
the point that it is consistent with experimental 
capabilities, and greatly increases the prospects 
for building large-scale quantum computers. 
The experiment by Yao et al. represents an 
essential first step in that direction. m 
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Muscarinic receptors 
become crystal clear 


Muscarinic acetylcholine receptors mediate many physiological responses of the 
nervous system. Structures of two of these receptors yield insight into how they 
bind drugs and their mechanism of action. SEE LETTERS P.547 & P.552 


REBECCA L. KOW & NEIL M. NATHANSON 


-protein-coupled receptors (GPCRs) 
‘ex the darling drug targets of many 
pharmaceutical and biotech compa- 

nies. This largest superfamily of cell-mem- 
brane receptors affects many aspects of life, 
including mood and behaviour, the immune 
system and the senses. In this issue, Haga et al. 1 
and Kruse and colleagues” describe the crystal 
structures of two GPCRs — the M2 and M3 
muscarinic acetylcholine receptors, which 
belong to the same GPCR family but couple to 
different effector proteins. The results not only 
advance our understanding of the structure 
and molecular pharmacology of this receptor 
family, but also contribute to our knowledge 
of GPCRs and membrane proteins in general. 
Muscarinic acetylcholine receptors 
(mAChRs) are expressed on most target 
organs of the autonomic branch of the periph- 
eral nervous system, which controls uncon- 
scious physiological responses such as heart 
rate, digestion, respiration and urination. They 
are also expressed in the central nervous sys- 
tem, where they modulate circuits that control 
movement and contribute to processes such 
as learning and memory. Drugs that target 
these receptors are being used and/or tested 
for conditions that include abnormal heart 
rate, asthma, overactive bladder, Alzheimer’s 
disease, Parkinson's disease and schizophrenia. 
Mammals have five subtypes of mAChR 
(M1-M5), which are divided into two 


functional groups: M2 and M4 preferentially 
couple to the G, family of G proteins, whereas 
M1, M3 and M5 couple to the G, family. The 
receptors affect different aspects of body func- 
tion. For instance, M2 decreases heart rate 
by controlling certain potassium-ion mem- 
brane channels, and M3 stimulates hormone 
secretion and relaxes airway smooth muscle. 
Understanding the intricate structural details 
of these receptors should help in the design of 
drugs that target specific mAChRs without 
producing undesirable side effects. 

Solving crystal structures of GPCRs is 
notoriously difficult because of the proteins’ 
natural flexibility. A trademark of these recep- 
tors is their seven transmembrane domains 
(TM1-TM7), which give rise to intracellular 
and extracellular loops. Of these, the third 
intracellular loop is particularly large and 
mobile. To solve the structures of M2 and M3, 
respectively, Haga et al. (page 547) and Kruse 
et al. (page 552) replaced this loop with phage 
T4 lysozyme, a protein that promotes crystal 
formation. As with other GPCRs previously 
crystallized by this approach, the modifica- 
tion did not alter the receptors’ ability to bind 
agonist ligands (molecules that activate the 
receptors, such as the neurotransmitter acetyl- 
choline) or antagonist ligands (molecules that 
block receptor activation). 

Haga et al.' describe the structure of M2 
bound to the muscarinic blocker 3-quinuclidi- 
nyl benzilate. They report that the structure of 
inactive M2 is similar to that of other inactive 
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Figure 1 | Differences between receptor 
subtypes. G-protein-coupled receptors have 
seven transmembrane (TM) domains that span the 
cell membrane, giving rise to three intracellular 
loops (IL1-1L3) and three extracellular loops 
(EL1-EL3). Haga et al.' and Kruse and colleagues” 
report the crystal structures of two such receptors, 
M2 and M3, which are muscarinic acetylcholine 
receptors. The intracellular ends of TM5 and TM6 
are farther apart in M2 (blue) and other G,-coupled 
receptors than in M3 (red) and other G,-coupled 
receptors. This and other structural differences 
between M2 and M3 may contribute to variations 
in the association and dissociation rates of drugs 
targeted to the two receptors. 


GPCRs, particularly in the transmembrane 
domains. But M2 differs most from other 
GPCRs at its extracellular surface and in 
having a 33-angstrém channel that contains the 
ligand binding pocket and extends beyond it. 
The ligand is oriented in the binding pocket by 
an aspartate amino-acid residue in TM3 and an 
asparagine residue in TM6. It also interacts with 
a lid formed by an ‘aromatic cage’ consisting 
of multiple tyrosine and tryptophan residues 
(located in TM3, TM6 and TM7). The authors 
found similar aromatic cages in three non- 
GPCR proteins that bind acetylcholine, which 
suggests that the aromatic cage is a common 
motif for binding this ligand. 

Kruse et al.” determine the structure of M3 
bound to tiotropium — a bronchodilator and 
mAChR blocker. Overall, the structures of 
inactive M3 and M2 are similar. For instance, 
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is developing an efficient mechanism for 
generating large numbers of entangled photons 
on demand. The authors were able to enhance 
the efficiency of their entangled-photon source 
using quantum-interference techniques, but 
further improvements in photon sources will 
be necessary. 

Topological error correction could also 
be performed using qubits based on other 
physical systems, such as superconducting 
devices or trapped ions, which, like optical 
approaches, have allowed strong progress to 
be made in quantum computing. Other forms 
of topological error correction® may be able to 
further reduce the sensitivity to experimental 
errors beyond that achieved by the authors. 


STRUCTURAL BIOLOGY 


For example, it is possible to produce a change 
in the phase of a probability amplitude that 
depends only on the number of times that the 
trajectory of a quantum system circles a specific 
point in a complex mathematical space known 
as Hilbert space, regardless of the exact shape of 
the trajectory. Topological error correction can 
increase the tolerance for experimental errors to 
the point that it is consistent with experimental 
capabilities, and greatly increases the prospects 
for building large-scale quantum computers. 
The experiment by Yao et al. represents an 
essential first step in that direction. m 


James D. Franson is in the Physics 
Department, University of Maryland, 


Muscarinic receptors 
become crystal clear 


Muscarinic acetylcholine receptors mediate many physiological responses of the 
nervous system. Structures of two of these receptors yield insight into how they 
bind drugs and their mechanism of action. SEE LETTERS P.547 & P.552 


REBECCA L. KOW & NEIL M. NATHANSON 


-protein-coupled receptors (GPCRs) 
‘ex the darling drug targets of many 
pharmaceutical and biotech compa- 

nies. This largest superfamily of cell-mem- 
brane receptors affects many aspects of life, 
including mood and behaviour, the immune 
system and the senses. In this issue, Haga et al. 1 
and Kruse and colleagues” describe the crystal 
structures of two GPCRs — the M2 and M3 
muscarinic acetylcholine receptors, which 
belong to the same GPCR family but couple to 
different effector proteins. The results not only 
advance our understanding of the structure 
and molecular pharmacology of this receptor 
family, but also contribute to our knowledge 
of GPCRs and membrane proteins in general. 
Muscarinic acetylcholine receptors 
(mAChRs) are expressed on most target 
organs of the autonomic branch of the periph- 
eral nervous system, which controls uncon- 
scious physiological responses such as heart 
rate, digestion, respiration and urination. They 
are also expressed in the central nervous sys- 
tem, where they modulate circuits that control 
movement and contribute to processes such 
as learning and memory. Drugs that target 
these receptors are being used and/or tested 
for conditions that include abnormal heart 
rate, asthma, overactive bladder, Alzheimer’s 
disease, Parkinson's disease and schizophrenia. 
Mammals have five subtypes of mAChR 
(M1-M5), which are divided into two 


functional groups: M2 and M4 preferentially 
couple to the G, family of G proteins, whereas 
M1, M3 and M5 couple to the G, family. The 
receptors affect different aspects of body func- 
tion. For instance, M2 decreases heart rate 
by controlling certain potassium-ion mem- 
brane channels, and M3 stimulates hormone 
secretion and relaxes airway smooth muscle. 
Understanding the intricate structural details 
of these receptors should help in the design of 
drugs that target specific mAChRs without 
producing undesirable side effects. 

Solving crystal structures of GPCRs is 
notoriously difficult because of the proteins’ 
natural flexibility. A trademark of these recep- 
tors is their seven transmembrane domains 
(TM1-TM7), which give rise to intracellular 
and extracellular loops. Of these, the third 
intracellular loop is particularly large and 
mobile. To solve the structures of M2 and M3, 
respectively, Haga et al. (page 547) and Kruse 
et al. (page 552) replaced this loop with phage 
T4 lysozyme, a protein that promotes crystal 
formation. As with other GPCRs previously 
crystallized by this approach, the modifica- 
tion did not alter the receptors’ ability to bind 
agonist ligands (molecules that activate the 
receptors, such as the neurotransmitter acetyl- 
choline) or antagonist ligands (molecules that 
block receptor activation). 

Haga et al.' describe the structure of M2 
bound to the muscarinic blocker 3-quinuclidi- 
nyl benzilate. They report that the structure of 
inactive M2 is similar to that of other inactive 
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Figure 1 | Differences between receptor 
subtypes. G-protein-coupled receptors have 
seven transmembrane (TM) domains that span the 
cell membrane, giving rise to three intracellular 
loops (IL1-1L3) and three extracellular loops 
(EL1-EL3). Haga et al.' and Kruse and colleagues” 
report the crystal structures of two such receptors, 
M2 and M3, which are muscarinic acetylcholine 
receptors. The intracellular ends of TM5 and TM6 
are farther apart in M2 (blue) and other G,-coupled 
receptors than in M3 (red) and other G,-coupled 
receptors. This and other structural differences 
between M2 and M3 may contribute to variations 
in the association and dissociation rates of drugs 
targeted to the two receptors. 


GPCRs, particularly in the transmembrane 
domains. But M2 differs most from other 
GPCRs at its extracellular surface and in 
having a 33-angstrém channel that contains the 
ligand binding pocket and extends beyond it. 
The ligand is oriented in the binding pocket by 
an aspartate amino-acid residue in TM3 and an 
asparagine residue in TM6. It also interacts with 
a lid formed by an ‘aromatic cage’ consisting 
of multiple tyrosine and tryptophan residues 
(located in TM3, TM6 and TM7). The authors 
found similar aromatic cages in three non- 
GPCR proteins that bind acetylcholine, which 
suggests that the aromatic cage is a common 
motif for binding this ligand. 

Kruse et al.” determine the structure of M3 
bound to tiotropium — a bronchodilator and 
mAChR blocker. Overall, the structures of 
inactive M3 and M2 are similar. For instance, 


a characteristic of mAChRs seems to be an 
outward bend in TM4, which is not seen in 
other GPCRs. But the authors also identify 
a few notable differences in the structures of 
inactive M2 and M3. One is the presence of a 
phenylalanine residue (rather than, as in M2, 
a leucine) in the second extracellular loop of 
M3, which creates a space in the receptor’s 
binding pocket. This small difference in the 
structure of the binding pocket may facilitate 
the development of drugs that have increased 
selectivity for a specific mAChR subtype. The 
relative position of TM7 in the two receptors 
also varies, possibly due to a difference in the 
TM2 amino acids with which TM7 interacts. 
Another difference between M2 and M3 is 
in the position of TMS, especially at the cyto- 
plasmic end of this domain. Specific TM6 resi- 
dues that interact with TMS at the cytoplasmic 
end determine the receptor’s coupling selec- 
tivity for various G proteins. This difference 
may be a factor in the coupling selectivity of 
other GPCRs, as the TM5-—TM6 distance in 
the M2 receptor is longer than that in M3 and 
similar to that in other G,-coupled GPCRs, 


CANCER GENETICS 


whereas in M3 this distance is similar to that 
in other G,-coupled GPCRs (Fig. 1). 

Kruse et al. used molecular-dynamics 
simulations to investigate the binding of 
tiotropium to mAChRs. Although this blocker 
binds to the acetylcholine binding site, the 
simulations indicate that it pauses at a sepa- 
rate (allosteric) site during both association 
and dissociation from the mAChR. Tiotropium 
dissociation from M3 is slower than from M2, 
perhaps because the second extracellular loop 
in M3 is less mobile. Exploiting such differ- 
ences in the extracellular surfaces of mAChRs 
may again contribute to the development of 
subtype-specific drugs, an endeavour that has 
previously been impeded by the close struc- 
tural similarity of the ligand binding regions in 
the transmembrane core of mAChRs. 

These latest advances inevitably raise further 
questions. For example, what are the differ- 
ences in the receptors’ structure on binding to 
antagonists, full agonists and ‘biased’ agonists 
(which elicit only a subset of physiological 
responses’)? Also, for the G-protein-interact- 
ing regions of GPCRs to be sufficiently ordered 


Evolution after 
tumour spread 


A genetic study of brain cancers in mice and humans reveals distinct mutations 
in primary tumours and their metastases, suggesting that the two disease 
‘compartments’ may require different treatments. SEE LETTER P.529 


STEVEN C. CLIFFORD 


he spread of a primary tumour to 

secondary sites in the body is a key step 

in the development of many cancers, 
and treatment of these secondary metastatic 
tumours represents one of the foremost chal- 
lenges in oncology. On page 529 of this issue, 
Wu et al.' describe two new mouse strains that 
serve as models of metastasis in the childhood 
brain cancer medulloblastoma”. In the mice, 
primary and metastatic tumours seem to 
occupy two genetically distinct ‘compartments, 
which arise from divergent DNA-sequence 
mutations that occur after metastasis. The 
authors also detect similar differences in 
human medulloblastoma tumours — a find- 
ing that may influence the development 
of anticancer therapies. 

Wuet al. used an experimental system called 
Sleeping Beauty mutagenesis” to introduce 
random genetic mutations into cerebellar 
progenitor cells in the developing brains of 


“This article and the paper? under discussion were 
published online on 15 February 2012. 


two strains of mice. These two new strains 
were derived by breeding the existing Tp53”" 
and Ptch*’ strains, which are predisposed to 
brain tumours”, with a strain that expresses 
the Sleeping Beauty mutagen in cerebellar 
progenitors. This system leaves a unique 
genetic ‘footprint at each mutation site, which 
allows mutated genes to be identified by DNA 
sequencing. Such mutagenesis experiments are 
used to identify genes in which mutations fre- 
quently arise, because the likelihood of these 
being involved in tumour development is 
reasoned to be above average”. 

As in mouse models of other cancer types”, 
Wu and colleagues’ Sleeping Beauty mutagen- 
esis accelerated the development of medullo- 
blastoma in both mouse strains. The authors 
identified a range of new and established cancer- 
related genes that had mutations, including 
some that have previously been implicated in 
medulloblastoma. They also observed that, 
following mutagenesis, mice of both strains 
developed metastases around a type of brain 
tissue called the leptomeninges, in patterns 
that are reminiscent of metastatic human 
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for crystallization, the receptor must be bound 
to its G protein’. Yet replacement of the third 
intracellular loop with the phage T4 lysozyme 
eliminates receptor coupling to G proteins. The 
conformational changes involved on binding 
to the G protein are therefore unclear. Crystal- 
lization of more intact mAChRs in complex 
with their cognate G proteins is thus required 
for detailed information about the pathways of 
receptor—G-protein coupling. These are some 
of the challenges we face in our attempts to 
better understand mAChRs. = 


Rebecca L. Kow and Neil M. Nathanson 
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medulloblastoma’. The two mouse models thus 
provided an opportunity to track mutations 
present in the primary and metastatic disease, 
and to investigate their genetic provenance. 

Wu et al. found that there were, in general, 
only a few mutations common to primary and 
metastatic tumours from the same mouse, 
but that the mutations in different metasta- 
ses from the same mouse tended to be more 
similar to each other. Moreover, certain muta- 
tions observed in metastases were detected at 
only low levels within the primary tumour, 
and some mutations were unique to one or 
the other tumour type. The authors conclude 
that their findings are consistent with a model 
in which metastases originate from rare cells 
in the primary tumour, and that, following 
metastasis, additional mutations accumulate 
independently — both in the primary tumour 
(post-dispersion events) and in metastases 
(post-metastasis events) (Fig. 1). 

Turning our attention away from mice, 
an obvious question is whether primary and 
metastatic tumours in the human disease also 
show ‘bi-compartmental genetics. Approxi- 
mately 30% of patients with medulloblastoma 
already have metastases when they are first 
diagnosed, and this is associated with a poor 
prognosis’. However, few previous studies 
have compared the biology of human primary 
tumours with their associated metastases, 
mainly because metastases are not routinely 
biopsied. Despite the limited sample availabil- 
ity, Wu et al.’ show initial evidence of differing 
genetics in primary and metastatic tumours 
from seven human patients. 

Further investigation is required to estab- 
lish whether the authors’ findings are broadly 
relevant to human medulloblastoma. The 
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Figure 1 | A bi-compartmental genetic model of cancer metastasis. By analysing the tumours from 
two strains of mice that model the brain cancer medulloblastoma, Wu et al.' found differences in the 
DNA-sequence mutations present in primary and metastatic tumours. They propose that rare cells in the 
primary tumour that are capable of metastasizing disperse to other sites in the brain, where they form 
metastases. The cells of the primary tumour and the metastases then continue to accumulate mutations, 


generating two distinct genetic compartments. 


human disease exhibits’ more complex 
patterns of metastases than are observed in 
mice, and is classified into four molecular sub- 
groups (WNT, SHH, Group 3 and Group 4), 
which each display distinct biological and 
clinical characteristics’. The Ptch*’” mice used 
by Wuet al.’ develop SHH-associated medullo- 
blastomas’; similar mutagenesis-driven 
approaches using existing mouse models 
of other medulloblastoma disease groups, such 
as WNT’, might prove informative. 

Perhaps the most urgent question arising 
from this study' is whether the genetic differ- 
ences between the two disease compartments 
lead to distinct biological features that make 
them respond differently to treatment. In 
mice, these compartments remain genomically 
characterized entities, the biological and ther- 
apeutic importance of which is untested. In 
humans, clinical-trial data show” that primary 
and metastatic sites respond similarly to cur- 
rent therapies (with cure achieved at both sites) 
in around 60% of children with metastatic dis- 
ease, but a more objective assessment of treat- 
ment response is confounded by the fact that 
primary tumours are mostly removed by sur- 
gery prior to treatment. Wu et al. provide initial 
evidence that the tumour compartments may 
respond differently to current therapies in cer- 
tain patients, but they rightly caution that these 
effects could also relate to clinical factors such 
as radiotherapy being delivered at different 
intensities to different tumour sites. 

Some of the mutations identified by Wu 
and colleagues’ experiments may also reveal 
biological processes or pathways that could 
offer drug targets for the improved treat- 
ment of primary tumours, metastases, or 
both. The new mouse strains provide excel- 
lent models in which to test this possibil- 
ity. The multitude and variety of mutations 
described by Wu et al.' are noteworthy, but 
the next challenge is to determine which of 


them can drive tumour development, which 
are therapeutically relevant, and which occur 
at sufficient frequency in the human disease 
to warrant their pursuit as potential targets. 
The authors justifiably reason that targets 
that are common to primary tumours and 
metastases, in both humans and mice, are 
those most attractive for further develop- 
ment. However, only one cellular pathway, 
insulin-dependent signalling, meets these 


CLIMATE CHANGE 


criteria on the basis of their current data. 

Providing answers to all these questions 
will require further biological investigation 
across species, as well as clinical studies. An 
additional challenge is posed by the fact that 
there are fewer than 700 cases of medullo- 
blastoma per year in Europe. More routine 
biopsy and characterization of human meta- 
stases will be essential, and the impetus and 
ethical justification for such a fundamen- 
tal change to clinical practice will, at least 
in part, come from experimental studies 
such as those presented here. Time will tell 
whether this tale of Sleeping Beauty and mice 
develops into a clinically relevant human 
paradigm. = 
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Shrinking glaciers 
under scrutiny 


Melting glaciers contribute to sea-level rise, but measuring their mass loss over 
time is difficult. An analysis of satellite data on Earth’s changing gravity field 
does just that, and delivers some unexpected results. SEE LETTER P.514 


JONATHAN BAMBER 


laciers and ice caps are pivotal features 

of both water resources and tourism. 

They are also a significant contribu- 
tor to sea-level rise. About 1.4 billion people 
are dependent on the rivers that flow from the 
Tibetan plateau and Himalayas’. Yet significant 
controversy’ and uncertainty surround the 
recent past and future behaviour of glaciers in 
this region. This is not so surprising when one 
considers the problem in hand. There are more 
than 160,000 glaciers and ice caps worldwide. 
Fewer than 120 (0.075%) have had their mass 
balance (the sum of the annual mass gains and 
losses of the glacier or ice cap) directly meas- 
ured, and for only 37 of these are there records 
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extending beyond 30 years. Extrapolating this 
tiny sample of observations to all glaciers and 
ice caps is a challenging task that inevitably 
leads to large uncertainties. 

On page 514 of this issue, Jacob and col- 
leagues’ describe a study based on satellite data 
for Earth’s changing gravity field that tackles 
this problem*. Their results have surprising 
implications for both the global contribu- 
tion of glaciers to sea level and the changes 
occurring in the mountain regions of Asia. 

Melting glaciers are an iconic symbol of 
climate change. On the basis of the limited 
data mentioned above, they seem to have 
been receding, largely uninterrupted, almost 


“This article and the paper® under discussion were 
published online on 8 February 2012. 


everywhere around the world for several dec- 
ades*. Scaling up the small sample of ground- 
based observations to produce global estimates 
is, however, fraught with difficulty. Size, local 
topography, altitude range, aspect and micro- 
climate all affect the response of individual 
glaciers in complex ways. Even the seasonality 
of changes in temperature and precipitation 
strongly influence the glaciers’ response, and 
those that terminate in a lake or ocean behave 
differently again. 

Nonetheless, until recently there was little 
alternative to some form of extrapolation of 
the terrestrial observations to large regions 
and numbers of glaciers. One such high- 
profile assessment’ concluded that, during the 
period 1996-2006, the mass loss from glaciers 
and ice caps (GICs) increased steadily, contrib- 
uting a sea-level rise of 1.1+0.24 millimetres 
per year by 2006. In this study’, the authors 
concluded that GICs had been the domi- 
nant mass contributor to sea-level rise over 
the study period, and they extrapolated their 
results forward to argue that this would also be 
the case in the future. 

Then along came the Gravity Recovery 
and Climate Experiment (GRACE), which 
consists of a pair of satellites that have been 
making global observations of changes 
in Earth’s gravity field since their launch 
in 2002. They have been used in various 
studies to examine the changing mass of the 
great ice sheets of Antarctica and Greenland® 
and several other large glaciated regions’. 
But, so far, the data have not been analysed 
simultaneously and consistently for all areas. 

The difficulty with doing this is that GRACE 
measures the gravity field of the complete 
Earth system. This includes mass exchange 
and/or mass redistribution in the oceans, 
atmosphere, solid Earth and land hydrology, 
in addition to any changes in GIC volume. To 
determine the latter, it is clearly essential to be 
able to separate it from the other sources of 
mass movement that affect the gravity field. A 
second, related issue is the effective resolution 
of the observations. The GRACE satellites are 
sensitive to changes in the gravity field over 
distances of a few hundred kilometres. They 
cannot ‘see’ the difference between the signal 
from one glacier or small ice cap and another. 

To isolate the GIC signal from others at the 
surface, Jacob and colleagues defined units of 
mass change — called mass concentrations, 
or mascons — within each of their 18 GIC 
regions (including the European Alps; Fig. 1). 
Each region might have many tens of mascons 
defining the geographic extent of significant 
ice volume within the sector’. Combined with 
global models of land hydrology and atmos- 
pheric-moisture content, the authors were able 
to isolate the GIC mass trends over the eight- 
year (2003-10) period of the observations. 
What they found was unexpected. 

First, the contribution of GICs (excluding 
the Antarctica and Greenland peripheral 
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Figure 1 | The Leschaux and Taléfre glaciers in the French Alps. The photograph highlights 

the complex and intricate topographic setting of these mountain glaciers and the difficulty in 
extrapolating observations from one glacier to others. Jacob and colleagues’ avoided these difficulties 
by using the area-integrated signal from satellite gravity data. 


GICs) to sea-level rise was less than half the 
value of the most recent, comprehensive esti- 
mate® obtained from extrapolation of in situ 
measurements for 2001-05 (0.41 + 0.08 
compared with 1.1 mm yr’). Second, losses 
for the High Mountain Asia region — com- 
prising the Himalayas, Karakoram, Tianshan, 
Pamirs and Tibet — were insignificant. Here, 
the mass-loss rate was just 4+ 20 gigatonnes 
per year (corresponding to 0.01 mmyr “ of sea- 
level rise), compared with previous estimates 
that were well over ten times larger. By a care- 
ful analysis, the authors discounted a possible 
tectonic origin for the huge discrepancy, and 
it seems that this region is more stable than 
previously believed. 

What is the significance of these results”? 
Understanding, and closing, the sea-level 
budget (the relative contributions of mass 
and thermal expansion to ocean-volume 
change) is crucial for testing predictions of 
future sea-level rise. Estimates of the future 
response of GICs to climate change are, in 
general, based on what we know about how 
they have responded in the past. A better esti- 
mate of past behaviour, such as that obtained 
by Jacob and colleagues, will therefore result 
in better estimates of future behaviour. 
Discussion of the demise of the Himalayan 
glaciers has been mired in controversy, partly 
because of basic errors’, but also because 
of the dearth of reliable data on past trends. 
Given their role as a water supply for so many 
people’, this has been a cause for concern and an 
outstanding issue. 

Of course, eight years is a relatively short 


observation period. Some of the regions, 
such as the Gulf of Alaska, experience large 
inter-annual variations in mass balance that 
are mainly due to variability in precipitation’. 
This is also true for the High Mountain Asia 
region’, and, as a consequence, a different 
measurement period could significantly alter 
the estimated trend for this sector. Further- 
more, some areas, such as the European Alps 
and Scandinavia, have been relatively well 
monitored, and thus constrained, using other 
approaches. Nonetheless, Jacob and colleagues 
have dramatically altered our understanding of 
recent global GIC volume changes and their 
contribution to sea-level rise. Now we need to 
work out what this means for estimating their 
future response. m 
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The case for open computer programs 


Darrel C. Ince!, Leslie Hatton? & John Graham-Cumming? 


Scientific communication relies on evidence that cannot be entirely included in publications, but the rise of 
computational science has added a new layer of inaccessibility. Although it is now accepted that data should be made 
available on request, the current regulations regarding the availability of software are inconsistent. We argue that, with 
some exceptions, anything less than the release of source programs is intolerable for results that depend on computation. 
The vagaries of hardware, software and natural language will always ensure that exact reproducibility remains 
uncertain, but withholding code increases the chances that efforts to reproduce results will fail. 


opportunities for scientific advance. Ever more powerful computers 

enable theories to be investigated that were thought almost 
intractable a decade ago, robust hardware technologies allow data collec- 
tion in the most inhospitable environments, more data are collected, and 
an increasingly rich set of software tools are now available with which to 
analyse computer-generated data. 

However, there is the difficulty of reproducibility, by which we mean 
the reproduction of a scientific paper’s central finding, rather than exact 
replication of each specific numerical result down to several decimal 
places. We examine the problem of reproducibility (for an early attempt 
at solving it, see ref. 1) in the context of openly available computer 
programs, or code. Our view is that we have reached the point that, with 
some exceptions, anything less than release of actual source code is an 
indefensible approach for any scientific results that depend on computa- 
tion, because not releasing such code raises needless, and needlessly 
confusing, roadblocks to reproducibility. 

At present, debate rages on the need to release computer programs 
associated with scientific experiments* *, with policies still ranging from 
mandatory total release to the release only of natural language descrip- 
tions, that is, written descriptions of computer program algorithms. 
Some journals have already changed their policies on computer program 
openness; Science, for example, now includes code in the list of items 
that should be supplied by an author’. Other journals promoting code 
availability include Geoscientific Model Development, which is devoted, 
at least in part, to model description and code publication, and 
Biostatistics, which has appointed an editor to assess the reproducibility 
of the software and data associated with an article’. 

In contrast, less stringent policies are exemplified by statements such 
as’ “Nature does not require authors to make code available, but we do 
expect a description detailed enough to allow others to write their own 
code to do similar analysis.” Although Nature’s broader policy states that 
“,.authors are required to make materials, data and associated protocols 
promptly available to readers...”, and editors and referees are fully 
empowered to demand and evaluate any specific code, we believe that 
its stated policy on code availability actively hinders reproducibility. 

Much of the debate about code transparency involves the philosophy of 
science, error validation and research ethics*”, but our contention is more 
practical: that the cause of reproducibility is best furthered by focusing on 
the dissection and understanding of code, a sentiment already appreciated 
by the growing open-source movement". Dissection and understanding 
of open code would improve the chances of both direct and indirect 
reproducibility. Direct reproducibility refers to the recompilation and 


T he rise of computational science has led to unprecedented 


rerunning of the code on, say, a different combination of hardware and 
systems software, to detect the sort of numerical computation'’’? and 
interpretation’* problems found in programming languages, which we 
discuss later. Without code, direct reproducibility is impossible. Indirect 
reproducibility refers to independent efforts to validate something other 
than the entire code package, for example a subset of equations or a par- 
ticular code module. Here, before time-consuming reprogramming of an 
entire model, researchers may simply want to check that incorrect coding of 
previously published equations has not invalidated a paper’s result, to 
extract and check detailed assumptions, or to run their own code against 
the original to check for statistical validity and explain any discrepancies. 

Any debate over the difficulties of reproducibility (which, as we will 
show, are non-trivial) must of course be tempered by recognizing the 
undeniable benefits afforded by the explosion of internet facilities and the 
rapid increase in raw computational speed and data-handling capability 
that has occurred as a result of major advances in computer technology". 
Such advances have presented science with a great opportunity to address 
problems that would have been intractable in even the recent past. It is 
our view, however, that the debate over code release should be resolved as 
soon as possible to benefit fully from our novel technical capabilities. On 
their own, finer computational grids, longer and more complex compu- 
tations and larger data sets—although highly attractive to scientific 
researchers—do not resolve underlying computational uncertainties of 
proven intransigence and may even exacerbate them. 

Although our arguments are focused on the implications of Nature’s 
code statement, it is symptomatic of a wider problem: the scientific 
community places more faith in computation than is justified. As we 
outline below and in two case studies (Boxes 1 and 2), ambiguity in its 
many forms and numerical errors render natural language descriptions 
insufficient and, in many cases, unintentionally misleading. 


The failure of code descriptions 
The curse of ambiguity 
Ambiguity in program descriptions leads to the possibility, if not the 
certainty, that a given natural language description can be converted 
into computer code in various ways, each of which may lead to different 
numerical outcomes. Innumerable potential issues exist, but might 
include mistaken order of operations, reference to different model ver- 
sions, or unclear calculations of uncertainties. The problem of ambiguity 
has haunted software development from its earliest days. 

Ambiguity can occur at the lexical, syntactic or semantic level’ and is 
not necessarily the result of incompetence or bad practice. It is a natural 
consequence of using natural language’® and is unavoidable. The 
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BOX | 


The United Kingdom Meteorological Office produces (in conjunction 
with the University of East Anglia’s Climatic Research Unit) the 
downloadable and widely used gridded temperature anomaly data 
sets known as HadCRUT and CRUTEMS. Yet even such a high-profile 
data set, developed by an organization with a good standard of 
software development*’, contained errors that would have been more 
quickly identified and rectified had the underlying code been readily 
available. 

In 2009, on examining the available data sets and the description of 
the algorithm®®, J.G.-C. identified a number of errors (the software he 
used to check the meteorological database is available upon request). 
One set of errors was procedural, and involved incorrect computation 
of historical average temperatures in a number of records in New 
Zealand and Australia. The Meteorological Office confirmed the errors, 
showed that they had resulted in errors up to 0.2 °C (either warmer or 
cooler) in the average temperature for Australia and New Zealand in 
some years before 1900, and issued an update to CRUTEM3. Two 
other errors occurred in the coding of the calculation of station errors 
(an estimate of the error in any average temperature reading). When 
corrected, a minor reduction in station errors resulted, improving the 
accuracy of the data. So, although these implementation problems did 
not lead to serious errors in the temperature data sets, they highlight 
the difficulty of translating a natural-language description (even with 
some formulae expressed mathematically) into code. 

These errors do not in any way reflect badly on the original authors. 
The code rewriting simply plays the part of peer review and it is normal 
to find such errors. Indeed, the discovery of such errors in ‘working’ 
software is exceedingly common in all computing, even when the 
software has been in use for a considerable time. This was 
emphatically demonstrated in a seminal IBM study?°, demonstrating 
that fully a third of all the software failures in the study took longer than 
5,000 execution years (execution time indicates the total time taken 
executing a program) to fail for the first time. 


problem is regarded as so axiomatic that its avoidance or minimization 
is routinely taught at the undergraduate level in computing degrees. Nor 
is the study of ambiguity confined to the classroom. Active research 
continues on the use of tools for the detection of ambiguity’, the avoid- 
ance of ambiguity in major projects'®, and the clarification of the intended 
functions of computer programs”. 

One proposed solution to the problem of ambiguity is to devote a 
large amount of attention to the description of a computer program, 
perhaps expressing it mathematically or in natural language augmented 
by mathematics. But this expectation would require researchers to 
acquire skills that are only peripheral to their work (set theory, predicate 
calculus and proof methods). Perhaps worse, investment of effort or 
resources alone cannot guarantee the absence of defect'’. A recent 
study” of a tightly specified, short, simply expressed algorithm whose 
semi-mathematical specification was supplemented by example outputs 
showed that major problems still arose with large numbers of programs 
individually implemented to this specification. In short, natural language 
descriptions cannot hope to avoid ambiguous program implementations, 
with unpredictable effects on results. 


Errors exist within ‘perfect’ descriptions 

Let us assume for a moment that a researcher, perhaps trained—as are 
computer scientists—to think of computer algorithms as mathematical 
objects, and fully versed in the formal semantics of software description, 
has managed to describe a computer program perfectly in some notation. 
Unfortunately, even such a description would not ensure direct or indirect 
reproducibility, because other forms of error or ambiguity (unrelated to 
natural language) are likely to creep in, leading to potentially serious 
uncertainties (Box 2). 
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BOX 2 


As discussed, unambiguous descriptions are no guarantee of 
reproducibility. One example from the geological literature makes the 
point?’. This study compared nine different commercial 
implementations of the same seismic data-processing algorithms, 
developed independently. Several sources of ambiguity were 
successfully excluded, the same data set was used, the signal- 
processing algorithms used were unambiguously specified in 
mathematics, and the same programming language was used 
(Fortran 77). The individual companies followed industry standards in 
code implementation. 

Approximately 200,000 lines of code were exercised in each of the 
packages in a 14-stage pipeline for which the output of each stage was 
the input to the next. The signal-processing algorithms used would be 
familiar to many scientists—such as Wiener deconvolution, acoustic 
wave equation solutions, fast Fourier transforms and numerous 
common statistical procedures. 

The initial stage involved reading 32-bit pressure data from tapes 
recorded in a marine environment. During the processing pipeline, the 
agreement between the results of each package declined from the six 
significant figures present in the input data to only between one and 
two in the final output. These data, however, were used by geologists to 
site extremely expensive marine drilling rigs and could 
“fundamentally affect the conclusions reached as to the nature of 
potential hydrocarbon accumulations” 3”. Furthermore “it seems 
reasonable to infer that the primary source of disagreement is indeed 
software error’’?”. Even porting other seismic software between 
different architectures using the same input data lost two out of six 
significant places!*. On the positive side, correction of the 
programming errors found during developer feedback led to 
considerably improved agreement. 

Although conducted some years ago, the study is just as relevant 
today. Fortran 77 is still in use in one dialect or another in scientific 
research, the same software assurance procedures are still widely 
used, and scientific programmers are still people, subject to human 
fallibility. 


First, there are programming errors. Over the years, researchers have 
quantified the occurrence rate of such defects to be approximately one to 
ten errors per thousand lines of source code”’. 

Second, there are errors associated with the numerical properties of 
scientific software. The execution of a program that manipulates the 
floating point numbers used by scientists is dependent on many factors 
outside the consideration of a program as a mathematical object”. 
Rounding errors can occur when numerous computations are repeatedly 
executed, as in weather forecasting”’. Although there is considerable 
research in this area, for example in arithmetic and floating point calcula- 
tions**’’, algorithms”, verification” and fundamental practice*’, much 
of it is published in outlets not routinely accessed by scientists in generic 
journals, such as Computers ¢& Mathematics with Applications, 
Mathematics in Computer Science and the SIAM Journal on Scientific 
Computing. 

Third, there are well-known ambiguities in some of the internationally 
standardized versions of commonly used programming languages in 
scientific computation’’. Monniaux” describes an alarming example 
relating to implementation of software features: 


“More subtly, on some platforms, the exact same expression, with the 
same values in the same variables, and the same compiler, can be evaluated 
to different results, depending on seemingly irrelevant statements (print- 
ing debugging information or other constructs that do not openly change 
the values of variables).” 


This is known as an order-of-evaluation problem and many program- 
ming languages are subject to its wilful ways. Ironically, such execution 
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ambiguity is quite deliberate and is present to allow a programming 
language compiler more flexibility in its optimization strategy. And 
even when programs are simple, or developed by the largest software 
companies, such errors remain surprisingly common: numerical 
ambiguity led Microsoft to declare in 2010 and reaffirm in September 
2011, that the treatment of floating point numbers in its popular Excel 
spreadsheet “...may affect the results of some numbers or formulas due 
to rounding and/or data truncation.” (http://support.microsoft.com/kb/ 
78113). 


Perfection is no guarantee of reproducibility 

Finally, even if a computer program could be unambiguously described 
and implemented without error, other problems can arise in machine 
deployment whereby the results from identical code often diverge when 
hardware and software configurations are changed”. So even perfection 
in one’s own software environment does not guarantee reproducibility. 
As a result, to maximize the chances of reproducibility and consistency, 
not only would we urge code release, but also a description of the 
hardware and software environment in which the program was executed 
and developed. 


Challenges are no excuse for closed code 

Nature’s policy on code release implies that algorithmic descriptions using 
mathematical specifications, equations, formal algorithmic descriptions 
or pseudocode (simplified version of complete code) may be required. But 
there is no guarantee that such tools can avoid ambiguity”, and even if 
they could, we have shown above that implementation and numerical 
errors—possibly compounded by differences in machine architecture— 
will still arise. So, even if complete code is made available, exact replication 
or even reproduction of central results may fail. A reasonable observer 
might therefore ask why code should be made available at all. Our res- 
ponse is that the alternative is far worse. Keeping code closed ensures that 
potential uncertainties or errors in a paper’s conclusions cannot be traced 
to ambiguity, numerical implementation, or machine architecture issues 
and prevents testing of indirect reproducibility. Although it is true that 
independent efforts to reproduce computational results without recourse 
to the original source code constitute an important approach, the all-too- 
common treatment of code as a black box unnecessarily slows and 
impedes valid efforts to evaluate model results. We therefore regard the 
non-availability of code as a serious impediment to reproducibility. 


Potential barriers and proposed solutions 


There are a number of barriers to the release of code. These include a 
shortage of tools that package up code and data in research articles; a 
shortage of central scientific repositories or indexes for program code; 
an understandable lack of perception of the computational problems 
with scientific code leading to the faulty assumption that program 
descriptions are adequate (something we address in this article); and 
finally that the development of program code is a subsidiary activity in 
the scientific effort. 


A modest proposal 

An effective step forward would be for journals to adopt a standard for 
declaring the degree of source code accessibility associated with a sci- 
entific paper. A number of simple categories illustrate the idea: 


e Full source code: full release of all source code used to produce the 
published results along with self-tests to build confidence in the 
quality of the delivered code, as is the case with Perl modules in 
the CPAN archive, for example (http://cpan.org). 

e Partial source code: full release of source code written by the 
researcher accompanied by associated documentation of ancillary 
packages used, for example commercial scientific subroutine libraries. 

e Marginal source code: release of executable code and an application 
programming interface to allow other researchers to write test cases. 

e No source code: no code at all provided. 
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This hierarchy of disclosure would alert both the readers and authors of 
a journal article to the fact that the issue is important and would high- 
light the degree to which results might be reproduced independently. 
There remain, however, some potential stumbling blocks, a number of 
which can easily be resolved using existing facilities. 


Intellectual property rights 

Clearly, if there is evidence of commercial potential or use, such as a 
patent or some copyright, then there is a problem. It is difficult to see 
how a journal might deal with this without substantial financial com- 
mitment to independent testing under a non-disclosure agreement or 
possibly even the purchase of commercial rights. Perhaps the simplest 
solution is for a journal to flag the software as ‘No source code’ (ideally 
giving the reasons) until such time as the source code can be included, 
either because the code goes into the public domain or is released under 
some free licence. Such a designation simply says that, for the moment, 
the results are not reproducible with the authors’ own source code, and 
that testing of the main results must proceed with independent 
approaches. 


Limited access 

Researchers may not have access to at least some of the software packages 
that are used for development. We suggest that this would not be a 
problem for most researchers: their institutions would normally provide 
such software. If it were to be a problem, then a journal could mark a 
publication as ‘Partial source code’. The release of the code, even without 
the software environment required for compilation and execution, would 
still be valuable in that it would address issues such as dissection and 
indirect reproducibility (see above) and would enable rewriting using 
other programming languages. 


Procedure 

Adopting the simple disclosure of the availability of source code will 
help make it clear to the readership of a journal that this is an important 
issue, while also giving them an idea of the degree of code release. 
However, we would further suggest that journals adopt a standard that 
specifies that supplementary material supporting a research article must 
describe each of the released modular components of any software used. 
Nature editors and referees are already empowered to include an 
appraisal of code in their judgement about the publication potential of 
the article, and this practice should be more widely advertised and 
supported. A good example of this approach is the way that the journal 
Geoscientific Model Development asks authors to describe their program 
code. 


Logistics 

Over the past two decades, the open-source community has solved the 
logistics of releasing and storing code while maintaining a cooperative 
development environment. SourceForge (http://www.sourceforge.net/) 
is an excellent example. Founded in 1999, it is a web-based source-code 
repository which acts as a free centralized location for developers 
working on open-source projects. It currently hosts around 300,000 
projects and has over two million registered users. Not only does it store 
source code but also it provides access to version control information, 
project wikis (websites that are easily modifiable by its users) and data- 
base access. We urge funding agencies to investigate and adopt similar 
solutions. 


Packaging 

There are a number of tools that enable code, data and the text of the article 
that depends on them to be packaged up. Two examples here are Sweave 
associated with the programming language R and the text-processing 
systems LaTeX and LyX, and GenePattern-Word RRS, a system specific 
to genomic research*'. Sweave allows text documents, figures, experi- 
mental data and computer programs to be combined in such a way that, 
for example, a change in a data file will result in the regeneration of all the 
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research outputs. GenePattern-Word RRS is similar in that it enables an 
author to link text, tables and figures to the analysis and data that yielded 
the results, reported in a word-processed document; it also allows further 
experimentation (for example, additional analyses can be carried out). It 
is still early days, however, and localized solutions are emerging at the 
grassroots level. Donoho and co-workers, for example, have developed 
software packages that allow anyone with access to the Matlab program- 
ming language and development environment to reproduce figures from 
their harmonic analysis articles, inspect source code, change parameters 
and access data sets*’. 


Steps to implementation 

Our thesis is that journal and funding body strictures relating to code 
implementations of scientific ideas are now largely obsolete. We have 
suggested one modest path to code availability in this article. There are a 
number of further steps that journals, academies and educational orga- 
nizations might consider taking: 


e Research funding bodies should commission research and develop- 
ment on tools that enable code to be integrated with other elements 
of scientific research such as data, graphical displays and the text of 
an article. 

Research funding bodies should provide metadata repositories that 
describe both programs and data produced by researchers. The 
Australian National Data Service (http://www.ands.org.au/) which 
acts as an index to data held by Australian research organizations, is 
one example of this approach. 

Journals should expect researchers to provide some modular 
description of the components of the software that support a 
research result; referees should take advantage of their right to 
appraise software as part of their reviewing task. An example of a 
modular description can be seen in a recent article published in 
Geoscientific Model Development”. 

Science departments should expand their educational activities into 
reproducibility. Clearly such teaching should be relevant to the 
science at hand; however, courses on statistics, programming and 
experimental method could be easily expanded and combined to 
include the concept of reproducibility. 
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Experimental demonstration of 
topological error correction 


Xing-Can Yao', Tian-Xiong Wang!, Hao-Ze Chen!, Wei-Bo Gaol, Austin G. Fowler’, Robert Raussendorf*, Zeng-Bing Chen!, 
Nai-Le Liu', Chao- Yang Lu’, You-Jin Deng’, Yu-Ao Chen! & Jian-Wei Pan! 


Scalable quantum computing can be achieved only if quantum bits are manipulated in a fault-tolerant fashion. 
Topological error correction—a method that combines topological quantum computation with quantum error 
correction—has the highest known tolerable error rate for a local architecture. The technique makes use of cluster 
states with topological properties and requires only nearest-neighbour interactions. Here we report the experimental 
demonstration of topological error correction with an eight-photon cluster state. We show that a correlation can be 
protected against a single error on any quantum bit. Also, when all quantum bits are simultaneously subjected to errors 
with equal probability, the effective error rate can be significantly reduced. Our work demonstrates the viability of 
topological error correction for fault-tolerant quantum information processing. 


Quantum computers exploit the laws of quantum mechanics and can 
solve many problems exponentially more efficiently than their classical 
counterparts’. However, in the laboratory the ubiquitous decoher- 
ence of quantum states makes it notoriously hard to achieve the 
required high degree of quantum control. To overcome this problem, 
quantum error correction has been invented**. The principal result in 
quantum error correction, the threshold theorem’®, states that as long 
as the error rate, p, per gate in a quantum computer is smaller than a 
threshold value, p,, arbitrarily long and accurate quantum computa- 
tion is efficiently possible. However, most methods of fault-tolerant 
quantum computing with a high threshold error rate (10 *-10 7) 
require strong and long-range interactions’°, and are thus difficult 
to implement. Local architectures are normally associated with much 
lower thresholds. For traditional concatenated codes on a two-dimen- 
sional lattice of quantum bits (qubits) with nearest-neighbour gates, 
the highest threshold known at present"? is 2.02 X 10°. 

In such lattices, it is advantageous to use topological error correc- 
tion''" (TEC) in the framework of topological cluster-state quantum 
computing. This scheme makes use of the topological properties in 
three-dimensional (3D) cluster states, which form an inherently 
error-robust ‘fabric’ for computation. Local measurements drive the 
computation and, at the same time, implement the error correction. 
Active error correction and topological methods are combined, yield- 
ing a high error threshold’*”* of 0.7-1.1% and tolerating loss rates'* up 
to 24.9%. This allows for the unavoidable imperfections of physical 
devices, and makes our implementation of TEC close to the experi- 
mental state of the art. For practical quantum computation with TEC, 
a larger cluster state of more qubits would be needed. The 3D archi- 
tecture can be further mapped onto a local setting in two spatial 
dimensions plus time’, also with nearest-neighbour interactions 
only. Two detailed architectures have already been proposed’®””. 
We note that a different topological scheme has been proposed in 
which quantum computation is driven by non-Abelian anyons'*”” 
and fault tolerance is achieved through passive stabilization afforded 
by a ground-state energy gap. 

Some simple quantum error correction codes have been experi- 
mentally demonstrated in nuclear magnetic resonance”, ion 


traps**”* and optical systems***°. However, the experimental realiza- 


tion of topological quantum error correction methods remains 
challenging. At present, multipartite cluster states can be generated 
with up to six photons and work is under way to create non-Abelian 
anyons for topological quantum computing’*’’. Here we develop an 
ultrabright entangled-photon source by using an interferometric Bell- 
type synthesizer. With this and a noise-reduction interferometer, 
we generate a polarization-encoded eight-photon cluster state, 
which is shown to possess the required topological properties for 
TEC. In accordance with the TEC scheme, we measure each photon 
(qubit) locally. Error syndromes are constructed from the measure- 
ment outcomes, and one topological quantum correlation is pro- 
tected. We demonstrate that if only one physical qubit suffers an 
error, the faulty qubit can be located and corrected, and that if all 
qubits are simultaneously subjected to errors with equal probability, 
the effective error rate is significantly reduced by error correction. 
This constitutes a proof-of-principle experiment that demonstrates 
the viability of TEC, a central ingredient in topological cluster-state 
computing. 


Cluster states and quantum computing 

In cluster-state quantum computing”®, projective one-qubit measure- 
ments replace unitary evolution as the elementary process driving a 
quantum computation. The computation begins with a highly 
entangled multi-qubit state, the “cluster state’ G) (ref. 27), which is 
specified by an interaction graph, G, and can be created from a product 
state through the pairwise Ising interaction over the edges in G. For 
each vertex i © G, we define a stabilizer as Kj=X;®,,Z; where the 
product is over all the interaction edges, ej Connecting vertex i to its 
nearest-neighbouring vertices, j. The symbols X; and Z; denote the 
bit- and phase-flip Pauli operators, respectively, acting on qubits i 
and j. State |G) is the unique joint eigenstate of a complete set of 
stabilizers K; such that K;,|G) = |G) for alli € G. 

Cluster states in d = 3 dimensions are resources for universal fault- 
tolerant quantum computing”, in which the TEC capability—shared 
with Kitaev’s toric code'’”* and the colour code””—is combined with 
the capability to process quantum information. 
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Topological error correction 

Quantum error correction and fault-tolerant quantum computing are 
possible with cluster states whenever the underlying interaction graph 
can be embedded in a 3D cell structure known as a cell complex”, 
which consists of volumes, faces, edges and vertices. Qubits are 
encoded on the edges and faces of a cell complex. The associated 
interaction graph connects the qubit on each face to the qubits on 
its surrounding edges through the interaction edges. Consider the 
elementary cell complex in Fig. 1a, shown by the dashed lines: it has 
one cubic volume, six square faces, twelve edges and eight vertices. 
The interaction edges, represented by the solid lines, form an 18-qubit 
cluster state, |G s). There are six face stabilizers, Ky (f= 1, 2, ..., 6). It 
follows that multiplication of these stabilizers cancels out all Z operators 
in Ky and thus yields a unit expectation value: (XX --- X¢) = 1. This 
leads to the straightforward but important observation that despite the 
X measurement on each individual face qubit having the random 
outcome +1, the product of all the outcomes on any closed surface, 
F, is +1. That is, any closed surface has the topological quantum 
correlation Cp=(@ ye pX)) = 1, where fis a face of F. 

A larger cell complex is displayed in Fig. 1b, which encodes and 
propagates a logical qubit. It consists of 5 <5 X T cells, where T 
specifies a span of simulated time (t). A ‘defect’ along the f direction 
(Fig. 1b, line of green dots) is first produced by performing local Z 
measurements. Then the topological quantum correlation, Cp, = 1, 
ona defect-enclosing closed surface (Fp), combined with the boundary, 
is used to encode a logical qubit. The evolution of the logical state from 
ty to ty is achieved by local X measurements on all other physical qubits 
between t, and t, (see ref. 31 for details). Quantum computing requires a 
much larger cell complex and more defects, where quantum algorithms 
are realized by appropriate braiding-like manipulation of defects (a 
sketch of the logical controlled-NOT gate is shown in Supplementary 
Information). 

The quantum computation is possible because the topological 
quantum correlation Cr, = 1 holds on defect-enclosing closed surfaces. 
The TEC capability arises from the Z, homology, a topological feature, 
of a sufficiently large 3D cell complex (Supplementary Information). 
For a given Fp, there exist many homologically equivalent closed surfaces 
with the same topological correlation (Cg, = 1). This redundancy leads 
to the topological protection of the correlation”. 

Remarkably, in TEC it is sufficient to deal with Z errors, because an 
X error either has no effect, if it occurs immediately before an X 
measurement, or is equivalent to multiple Z errors. Finally, as TEC 
is implemented in topological cluster-state quantum computing—a 


Figure 1 | Topological cluster states. a, Elementary lattice cell. Dashed lines 
represent the edges of the associated cell complex and solid lines represent the 
edges of the interaction graph. Qubits (spheres) are encoded on the faces and 
edges of the elementary cell. b, Larger topological cluster state of 5 X 5 X T cells. 
Green dots represent local Z measurements, which effectively remove the 
measured qubits from the cluster state and thereby create a non-trivial topology 
capable of supporting a single correlation. Red dots represent Z errors. Red cells 
indicate the ends of error chains where Cz = —1. One axis of the cluster can be 
regarded as simulating the ‘circuit time’, t. The evolution of logical states from t, 
to tf, is achieved by performing local X measurements on all physical qubits 
between f, and f,. 
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measurement-based process—corrections suggested by TEC are not 
applied to the remaining cluster state but rather to the classical out- 
comes of X measurements. 


Simpler topological cluster state 

The cell complex in Fig. 1b encodes a propagating logical qubit in terms 
of one topological correlation, Cz, = 1, and is robust against a local Z 
error. However, it contains 25 elementary cells and 180 physical qubits 
for each layer of complex over a unit time span, which is beyond the 
capacity of available experimental techniques. We design a simpler 
graph state, |Gg) (Fig. 2a), to mimic the cell complex of Fig. 1b. 

The topological feature of |Gg) can be seen from its association with 
the 3D cell complex in Fig. 2b, which consists of four elementary 
volumes, {v, w, y; z}; six faces, {f, fr, fa fas fa fo}s two edges, {e7, eg}; 
and two vertices, {s, t}. All six faces have the same boundary, e; U eg, 
and any two of them form a closed surface, F. The centre volume is 
removed to resemble the defect in Fig. 1b, and the topological cor- 
relation to be protected, Cr,,, reads 


Cp, = (XsX6) =1 (1) 


In this simple cell complex, the topological correlation Cp, =1 is 
already multiply encoded: it is represented by any expectation 
(X;X;) with i © {1, 2, 5} andj € {3, 4, 6}. Moreover, there exist four 
other closed surfaces, corresponding to the respective boundaries of 
the volumes {v, w, y, z}, that do not enclose the defect. The ‘redundant’ 
topological correlations are 


(X1Xp) = (X2X5) = (X3Xo) = (X3X4) = 1 (2) 


These can be used as error syndromes in TEC, which makes one or 
more of them equal to — 1. As shown in Table 1, a single Z error on any 
physical qubit can be located and corrected. 

Therefore, from the aspect of TEC capability, the cluster state |Gg) 
is analogous to the cell complex in Fig. 1b. Each protects one topo- 
logical correlation and is robust against a single Z error, despite the 
cell complex in Fig. 2b being too small to propagate a logical qubit (see 
Supplementary Information for details). 


Figure 2 | Cluster state | Gg) and its cell complex. a, Gg, the interaction graph 
of |Gs). b, The corresponding 3D cell complex, with volumes {v, w, y, z}, faces 
{fis fa fa fas fo» fo} edges {e7, eg} and vertices {s, t}. The exterior and the centre 
volume are not in the complex. For better illustration, the cell complex is cut 
open and the foreground quarter is removed (silhouette view from right is 
shown for clarity). 


©2012 Macmillan Publishers Limited. All rights reserved 


Table 1 | Gg) and the syndromes (X;X)) 


Qubit with Z error (XX) (X2X5) (X3X6) (X3X4) 
1 =i] 1 il il 
2 = =] 1 1 
3 1 1 = =i 
4 1 1 1 =i 
5 1 =] 1 i 
6 1 1 =I 1 


Preparation of the eight-photon cluster state 


In our experiment, we create the desired eight-photon cluster state 
using spontaneous parametric down-conversion and linear optics. 
The first step is to develop an ultrabright, high-fidelity entangled- 
photon source. As shown in Fig. 3a, an ultraviolet mode-locked laser 
pulse (power, 915 mW) passes through a B-barium borate crystal, 
generating a pair of polarization-entangled photons in the state 
|$) =(|HH) +|VV))//2. Using an interferometric Bell-state syn- 
thesizer’, we guide photons of different bandwidths (Fig. 3a, red 
and blue dots, respectively) along separate paths. This disentangles 
the temporal information from the polarization information. By con- 
trast with the conventional narrowband filtering technique, this pro- 
cess does not result in photon loss and we thus achieve ultrahigh 
brightness. Four pairs of such entangled photons are prepared and 
labelled as 1-2, 3-4, 5-6 and 7-8 (Fig. 3b). Then we generate two 
graph states, each of four photons. The first is a Greenberger-Horne- 
Zeilinger state, (|H®*),_,+|V®*),_4) / V2, obtained by superpos- 
ing photons 2 and 4 on a polarizing beam splitter (PBS), which 
transmits horizontal polarization (H) and reflects vertical polarization 
(V). At the same time, photons 6 and 8 are interfered on a polariza- 
tion-dependent beam splitter (PDBS) and then separately pass 
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through two other PDBSs. The first PDBS has transmitting probabilities 
Ty = 1 and Ty=1/3, and the second and third have Ty = 1/3 and 
Ty = 1. The combination of these three PDBSs acts as a controlled- 
phase gate****. With a success probability of one-ninth, there is 
twofold coincidence in paths 6’ and 8’, yielding a four-photon 
cluster state™* [|HH)56(|HH)7s + | VV)7s) + | VV)s6(|HH)73 — | VV)zs)]/2. 
Finally, photons 4’ and 6’ are superposed on PBS. When eight 
photons come out of the output ports simultaneously, we obtain an 
entangled eight-photon cluster state: 


lp) = 
1 
2 
This is exactly the cluster state |Gg) shown in Fig. 2a under Hadamard 
operations H®® on all qubits. We note that the photons, which are 
interfered on the PBSs or the PDBS, have the same bandwidth, and that 
a star topology of the eight-photon interferometer* leads to an effec- 
tive noise reduction. 

To ensure good spatial and temporal overlap, the photons are 
also spectrally filtered, with full-widths at half-maximum of 
Adewum = 8nm for photons 1, 3, 5 and 7 and Adgwum = 2.8nm 
for photons 2, 4, 6 and 8, and are coupled by single-mode fibres. 
We obtain an average twofold coincidence count of ~3.4 x 10°s + 
and a visibility of ~94% in the {|H), | V)} basis as well as in the {|-+), 
|—)} basis, where | +) =(|H)+|V))/\/2. Fine adjustments of the 
delays between the different paths are tuned to ensure that all the 
interfering photons arrive at the PBSs and the PDBS simultaneously. 

Measurement of each photon is made using a polarization analyser, 
which contains a combination of a QWP, a HWP and a PBS together 


(3 
[[#2°), (IH) ne-+1VV)re) +1V2%),_ (IHF re —1VV)re) | 


Figure 3 | Experimental set-up for the generation of the eight-photon 
cluster state and the demonstration of topological error correction. 

a, Creation of ultrabright entangled-photon pairs. An ultraviolet laser pulse 
passes through a 2-mm, nonlinear B-barium borate crystal, creating an 
entangled photon pair {a, b} with density matrix 

p= (\H2)| Vé)(Vél(H2|+|V$)|He) (H| (Vs))/2 by parametric down- 
conversion, where o and e indicate ordinary and extraordinary polarizations, 
respectively perpendicular and parallel relative to the V-polarized pump. After 
both photons pass through compensators, which include a 45° half-wave plate 
(HWP) anda 1-mm f-barium borate crystal, one of the photons’ polarizations 
is rotated by another 45° HWP. Then we re-overlap the two photons on a PBS, 
creating an entangled photon pair in a state 


\ha») =(|H)|H) +e”|V)|V))@ lea) lov) //2, where |e,) is a state in which all 
photons in path a have extraordinary polarization and |o,) isa state in which all 
photons in path b have ordinary polarization. b, To create the desired cluster 
state, we combine photons from paths 6 and 8 at the first PDBS and let each 
photon pass through another PDBS (PDBS’), resulting a controlled-phase 
operation between the two photons. At the same time, photons 2 and 4 are 
interfered on PBS,. Then photons 4’ and 6’ are overlapped on PBS,. On 
coincidence detection, we create the eight-photon cluster state (equation (3)) 
for topological error correction. c, Polarization analyser for each individual 
photon, containing a quarter-wave plate (QWP), a HWP, a PBS and two single- 
mode, fibre-coupled single-photon detectors. 
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with a single-mode, fibre-coupled single-photon detectors in each 
output of the PBS (Fig. 3c). The complete set of all 256 possible 
combinations of eight-photon coincidence events is registered by a 
home-made programmable coincidence logic unit based on a field- 
programmable gate array. We obtain an eightfold coincidence rate of 
3.2 per hour. On the basis of the measurements for the 256 possible 
polarization combinations in the {|H), | V)} basis (Fig. 4a), we obtain a 
signal-to-noise ratio, defined as the ratio of the average number of 
desired components to that of non-desired components, of about 
200:1. This indicates that we have been successful in preparing the 
desired eight-photon cluster state. 

To characterize the cluster state more precisely, we use the entan- 
glement witness method to determine its fidelity. For this purpose, we 
construct a witness that allows for the lower bound on the state fidelity 


and requires only eight measurement settings (Supplementary 
Information): 


Ws= 5 — (Wy) Wl=lv'y(v'D 


= F (|H)(HIS°—|V)(V|°),_ .@ (XrXa— Yr¥e) 
+4 (do-uiae*) @(|H)(HI®?—-|V)(VI9"),5 
k=0 1-6 


Here (i/'|/) = 0 and M; = cos (kn/6)X + sin (km/6)Y. The measured 
expectation value of each measurement setting in Ws is shown in 
Fig. 4b. These yield the witness (Ws) = —0.105 + 0.023, which is nega- 
tive by 4.5 s.d. The state fidelity is F > 0.5 — (Wg) =0.605 + 0.023. This 
confirmed the presence of genuine eight-photon entanglement. 


Experimental topological error correction 

Given such a cluster state, topological error correction is implemented 
using a series of single-qubit measurements and classical correction 
operations. In the laboratory, operations are performed on state |\) 
(equation (3)), which differs from |Gg) in Fig. 2a by the Hadamard 
operation H®®*. Therefore, the correlation to be protected in equation 
(1), (X5Xe), corresponds to (Z;Z,) in the experiment; similarly, each 
(X;X;) in equation (2) corresponds to (Z;Z)). Furthermore, X errors are 
simulated instead of Z errors. 


0 h) 
N 
fo) 


for) 
oO 


Eightfold coincidences (8! 
wo 


Figure 4 | Experimental results for the created eight-photon cluster state. 
a, Measured eightfold coincidence in the {|H), V} basis. b, The expectation 
values for different witness measurement settings. The measurement settings 
are Ag = (|H)(H|®° — | VXV|®°),_6X7Xe, Ai = (|HXH| °° — | VXV|® 1-6 
Y7Yz and B; = M®*(|HH|®? — | V)(V|®?)z3 with i = 0, ..., 5. The 
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In the experiment, the noisy quantum channels on polarization 
qubits are simulated by one HWP positioned between two QWPs, 
which are set at 90° relative to the horizontal. By randomly setting 
the HWP axis to be oriented at +0 with respect to the horizontal 
direction, the noisy quantum channel can be simulated with a bit-flip 
error probability of p = sin?(20). 

We first study the case in which only a single X error occurs on one 
of the six photons {1, ..., 6}. The syndrome correlations are measured 
(Fig. 5). For comparison, in Fig. 4c we plot the correlations without 
any simulated error. This comparison, together with Table 1, makes it 
possible to locate precisely the physical qubit undergoing an X error. 

We then consider the case in which all six photons are simulta- 
neously subject to a random X error with equal probability 0 < p< 1 
and study the rate of errors, (Z;Z6) = —1, for the topological quantum 
correlation (Z,Z,). Without error correction, the error rate of correla- 
tion(Z;Z.) isP = 1— (1 — py al p- With error correction, the residual 
error becomes 


P=1—[(1—p)°+p*] — [6p(1—p)" 4 
[9p°(1—p)* + 9(1—p)’P*] 
For small p, the residual error rate after error correction is significantly 
reduced relative to the unprotected case. As shown in Fig. 6, the experi- 
mental results are in good agreement with these theoretical predic- 
tions. Considerable improvement of the robustness of the correlation 
(ZsZe) can be seen both in theory and in practice. 

In the experiment, the whole measurement takes about 80 days. 
This requires our set-up to be extremely stable. The imperfections in 
the experiment are mainly due to the undesired components in the 
{|H), |V)} basis, which arise from higher-order emissions of entangled 
photons, and the imperfect photon overlapping at the PBSs and the 


PDBS. In spite of these issues, the viability of TEC is successfully 
demonstrated in the experiment. 


6(1—p)p"] 


Discussion 


In this work, we have experimentally demonstrated TEC with an 
eight-photon cluster state. This state represents the current state of 
the art for preparation of cluster states in qubit systems and is of 
particular interest in studying multipartite entanglement and 
quantum information processing. The scalable construction of cluster 
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error. Error bars, 1 s.d., deduced from propagated Poissonian counting 
statistics of the raw detection events. 
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Figure 5 | Experimental results of syndrome correlations for topological error correction. Only one qubit is subjected to an X error in each plot. The 
measurement for each error setting takes about 80h. Error bars, 1 s.d., deduced from propagated Poissonian counting statistics of the raw detection events. 


states in future will require further development of high-efficiency 
entanglement sources and single-photon detectors**. Recent results 
have shown that if the product of the number-resolving detector 
efficiency and the source efficiency is greater than two-thirds, efficient 
linear optical quantum computation is possible**. There has been 
technical progress towards this goal, such as deterministic, storable, 
single-photon sources” and photon-number-resolving detectors”. 
Our demonstration of TEC is a further step towards fault-tolerant 
quantum computation. In the scheme, given sufficient qubits and 
physical error rates below 0.7-1.1%, arbitrary quantum computations 
can be performed arbitrarily reliably. The high threshold error rate is 
especially remarkable given that only nearest-neighbour interactions 
are required. Owing to these advantages, TEC is especially well suited 
for physical systems geometrically constrained to nearest-neighbour 
interactions, such as quantum dots”, Josephson junction qubits”, ion 
traps*', cold atoms in optical lattices** and photonic modules’’”. A 
quantum gate with an error rate below the threshold required in 
TEC is within reach of present technology*’. It would be interesting 
in future work to exploit cluster states of the maximum achievable 
size, to implement topologically error-protected quantum algorithms 
using local measurements. 
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Figure 6 | Experimental results of topological error correction. All physical 
qubits are simultaneously subject to an X error with equal probability ranging 
from 0 to 1. The blue circles and blue lines represent the experimental and, 
respectively, theoretical values of the error rate for the protected correlation 
without TEC, and the red squares and red lines similarly represent the error rate 
with TEC. The agreement between the experimental and the theoretical results 
demonstrates the viability of TEC. The measurement of each data point takes 
80h. Error bars, 1 s.d., deduced from propagated Poissonian counting statistics 
of the raw detection events. 
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Ubiquitin-dependent regulation of COPII 
coat size and function 


Lingyan Jin'*, Kanika Bajaj Pahuja>?*, Katherine E. Wickliffe’, Amita Gorur’’, Christine Baumgartel!, Randy Schekman!? 


& Michael Rape! 


Packaging of proteins from the endoplasmic reticulum into COPII vesicles is essential for secretion. In cells, most COPII 
vesicles are approximately 60-80 nm in diameter, yet some must increase their size to accommodate 300-400 nm 
procollagen fibres or chylomicrons. Impaired COPII function results in collagen deposition defects, cranio- 
lenticulo-sutural dysplasia, or chylomicron retention disease, but mechanisms to enlarge COPII coats have remained 
elusive. Here, we identified the ubiquitin ligase CUL3-KLHLI2 as a regulator of COPII coat formation. CUL3-KLHL12 
catalyses the monoubiquitylation of the COPII-component SEC31 and drives the assembly of large COPII coats. As a 
result, ubiquitylation by CUL3-KLHLI2 is essential for collagen export, yet less important for the transport of small 
cargo. We conclude that monoubiquitylation controls the size and function of a vesicle coat. 


The extracellular matrix provides a scaffold for cell attachment and 
binding sites for membrane receptors, such as integrins, making it 
essential for the development of all metazoans'’. When engaged with 
the extracellular matrix, integrins trigger signalling cascades that 
regulate cell morphology and division, yet in the absence of a func- 
tional extracellular matrix, integrins are removed from the plasma 
membrane by endocytosis’. The proper interplay between integrins 
and the extracellular matrix is particularly important during early 
development’, as stem cells depend on integrin-dependent signalling 
for division and survival’. 

The establishment of the extracellular matrix requires secretion of 
several proteins, including its major constituent collagen. Following 
its synthesis in the endoplasmic reticulum, the export of collagen from 
cells depends on COPII vesicles*°, and mutations in genes encoding 
COPII proteins lead to collagen deposition defects, skeletal aberra- 
tions and developmental diseases, such as cranio-lenticulo-sutural 
dysplasia’®”’. 

COPII vesicles are surrounded by a coat consisting of the SARI 
GTPase, SEC23-SEC24 adaptors, and an outer layer of SEC13-SEC31 
heterotetramers'’. These coat proteins self-assemble into cuboctahedral 
structures with a diameter of approximately 60-80 nm, which are too 
small to accommodate a procollagen fibre with a length of 300- 
400 nm'*"*. Thus, collagen transport in cells must involve factors that 
are absent from in vitro self-assembly reactions. Indeed, TANGO] (also 
known as MIA3) and its partner cTAGES interact with collagen and 
SEC23-SEC24, thereby recruiting collagen to nascent COPII coats'*””. 
The deletion of Tango1 in mice resulted in collagen deposition defects 
similar to those caused by loss of COPII’*, and mutations in human 
TANGO] are associated with premature myocardial infarction”. 
However, TANGO] is not known to regulate the size of COPII coats 
and mechanisms that permit the COPII coat to accommodate a large 
cargo remain poorly understood. 

By analysing mouse embryonic stem (ES) cell division, we have 
identified CUL3-KLHL12 as a regulator of COPII coat formation. 
CUL3-KLHL12 monoubiquitylates SEC31 and drives assembly of large 
COPII coats. As a result, ubiquitylation by CUL3-KLHL12 is essential 
for collagen export, a step that is required for integrin-dependent mouse 


ES cell division. We conclude that monoubiquitylation determines the 
size and function of a vesicle coat. 


CUL3 regulates mouse ES cell morphology 

To provide insight into stem cell-specific division networks, we 
depleted ubiquitylation enzymes from mouse ES cells and scored 
for effects on proliferation and morphology. We found that loss of 
the ubiquitin ligase CUL3 caused mouse ES cells to form tightly 
packed cell clusters with prominent actin cables and aberrant adhesions, 
as seen by confocal microscopy analysis of actin and vinculin local- 
ization (Fig. la). A similar phenotype was observed upon depletion 
of UBA3, a component of the NEDD8 pathway that activates CUL3 
(Supplementary Fig. la). CUL3-depleted mouse ES cells were 
delayed in proliferation (Supplementary Fig. 1b, d), yet retained their 
pluripotency, as seen by OCT4- and alkaline phosphatase-staining and 
the absence of differentiation markers in expression analyses (Sup- 
plementary Figs 1c, e, fand 2b). In contrast to mouse ES cells, depletion 
of CUL3 had weaker consequences in fibroblasts (Fig. 1a), although a 
previously reported increase in multinucleation was observed (Sup- 
plementary Fig. 1g; ref. 20). 

Several observations show that the mouse ES cell phenotypes were 
caused by specific depletion of CUL3. First, several short interfering 
RNAs targeting distinct regions of the Cul3 messenger RNA had the 
same effects on mouse ES cells, with a close correlation between 
knockdown efficiency and strength of phenotype (Supplementary 
Fig. 2a). Second, microarray analysis showed a strong reduction in 
Cul3 mRNA upon siRNA treatment, whereas no other gene was sig- 
nificantly and reproducibly affected (Supplementary Fig. 2b). Third, 
siRNAs that target closely related proteins, such as other cullins, did 
not disturb the morphology of mouse ES cells (Supplementary Fig. 2c). 

The aberrant morphology of CUL3-depleted mouse ES cells was 
reminiscent of increased RhoA GTPase activity, which triggers actin 
filament bundling*’. Accordingly, a reduction in RhoA levels or inhibi- 
tion of the RhoA effector kinase ROCK1 rescued CUL3-depleted mouse 
ES cells from compaction (Supplementary Fig. 3a). Among several 
possibilities, higher RhoA activity in the absence of CUL3 could result 
from RhoA stabilization or defective integrin signalling. Stabilization 
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Figure 1 | CUL3 regulates mouse ES cell morphology. a, Left, D3 mouse ES 
cells were plated on gelatin and transfected with siRNAs targeting Cul3 
(siCul3), which resulted in cell clustering (phase microscopy; upper panel) and 
compaction (confocal microscopy: vinculin, green; actin, red; DNA, blue). 
Right, Depletion of CUL3 from mouse 3T3 fibroblasts did not cause cell 
compaction. Phase images original magnification was X10, fluorescence 
images X40. b, CUL3 is required for integrin localization to the mouse ES cell 
plasma membrane. D3 mouse ES cells were plated on gelatin (top two rows), 
growth-factor-depleted Matrigel or collagen IV. Following CUL3 depletion, cell 
compaction and integrin-targeting to the plasma membrane were analysed by 
confocal microscopy (actin, red; integrin B1, green; DNA, blue). Original 
magnification 40. 


of RhoA by co-depletion of all RhoA-specific CUL3 adaptors, the 
BACURDs”, did not affect mouse ES cell morphology (data not 
shown). By contrast, depletion of components of integrin signalling 
pathways phenocopied the loss of CUL3 in mouse ES cells (Sup- 
plementary Fig. 3b); partial reduction in CUL3 levels showed synthetic 
lethality with dasatinib, an inhibitor of the SRC kinase that acts down- 
stream of integrin activation (Supplementary Fig. 3c); and integrin B1 
was absent from the plasma membrane of CUL3-depleted mouse ES 
cells (Fig. 1b). 

CUL3 could regulate integrin synthesis and trafficking, or it could 
allow for efficient deposition of extracellular matrix proteins to prevent 
integrin internalization’. To distinguish between these possibilities, we 
grew mouse ES cells on growth-factor-depleted Matrigel to provide an 
exogenous extracellular matrix. Strikingly, under these conditions, 
integrin B1 was found at the plasma membrane of CUL3-depleted 
mouse ES cells and no cell clustering was observed (Fig. 1b). Thus, 
CUL3 controls integrin signalling in mouse ES cells, most likely by 
supporting the establishment of a functional extracellular matrix. 
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KLHL12 is a key CUL3 adaptor in mouse ES cells 


CUL3 recruits substrates through adaptors with BTB domains”, yet 
siRNA approaches did not yield roles for BTB proteins in ES cells. As 
an alternative strategy to isolate CUL3 adaptors, we made use of the 
observation that stem cell regulators are highly expressed in ES cells, 
but downregulated upon differentiation”. Using affinity purification 
and mass spectrometry, we identified 31 BTB proteins that interact 
with CUL3 in mouse ES cells (Supplementary Fig. 4a; Supplementary 
Table 1). When analysed by quantitative polymerase chain reaction 
with reverse transcription (qRT-PCR) and immunoblot, we found 
that three adaptors, KLHL12, KBTBD8 and IBTK, were highly 
expressed in mouse ES cells, but downregulated upon differentiation 
(Fig. 2a, b and Supplementary Fig. 3d). Next, we depleted these adaptors 
from mouse ES cells that were sensitized for changes in integrin sig- 
nalling by treatment with dasatinib. Importantly, depletion of KLHL12, 
but no other BTB protein, resulted in mouse ES cell compaction, as 
seen with loss of CUL3 (Fig. 2c). Accordingly, endogenous KLHL12 
effectively binds CUL3 in mouse ES cells (Supplementary Fig. 4b). 
These experiments, therefore, identify KLHL12 as a key substrate- 
adaptor for CUL3 in mouse ES cells and the CUL3-KLHL12 ubiquitin 
ligase as an important regulator of mouse ES cell morphology. 


23-26 


CUL3 monoubiquitylates SEC31 


To isolate the substrates of CUL3—KLHL12, we constructed 293T cell 
lines that allowed for the inducible expression of Flag-KLHL12. By 
affinity chromatography and mass spectrometry, we identified the 
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Figure 2 | KLHL12 is a substrate adaptor for CUL3 in mouse ES cells. a, D3 
mouse ES cells were subjected to differentiation, and mRNA levels of indicated 
proteins were measured by qRT-PCR. EB, embryoid bodies. b, KLHL12 protein 
is downregulated upon differentiation, as observed by immunoblot of above 
samples. c, KLHL12 is a critical CUL3-adaptor in mouse ES cells. D3 mouse ES 
cells were sensitized towards altered integrin-signalling with dasatinib and 
monitored for compaction by phase (upper panel) or confocal microscopy 
(actin, red; vinculin, green; DNA, blue). Original magnification < 40. 
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COPII proteins SEC13 and SEC31 as specific binding partners of 
KLHL12 (Fig. 3a and Supplementary Table 2). Immunoblotting con- 
firmed retention of endogenous SEC13 and SEC31 in KLHL12 purifi- 
cations, but not in precipitates of other BTB proteins (Supplementary 
Fig. 5a). As seen in pull-down assays, KLHL12 directly bound SEC31, 
but not SEC13 (Supplementary Fig. 5c, d), and this interaction was 
mediated by the amino terminus of SEC31 (Supplementary Fig. 6a) 
and the Kelch domain of KLHL12 (Supplementary Fig. 6b). In cells, 
approximately 30% of endogenous KLHL12 was associated with 
SEC13-SEC31 (Fig. 3b and Supplementary Fig. 5b). Consistent with 
such a prominent interaction, SEC13-SEC31 and KLHL12 colocalized 
in punctae, which are likely to represent endoplasmic reticulum exit 
sites of COPII vesicles (Fig. 3c;*). Importantly, siRNAs that compromise 
COPII resulted in mouse ES cell compaction (Fig. 3d), indicating that 
CUL3-KLHL12 and the COPII coat act in the same pathway. 

In vitro, CUL3-KLHL12 catalysed the monoubiquitylation of 
SEC31 (Fig. 3e), which was not observed if a KLHL12 mutant with 
a defective SEC31-binding interface was used (Fig. 3f). SEC31 was also 
monoubiquitylated in cells, which was strongly increased upon 
expression of KLHL12 (Fig. 3g). KLHL12 mutants unable to bind 
SEC31 abolished its monoubiquitylation (Fig. 3h), which is likely to 
be due to dimerization with and inactivation of endogenous KLHL12 
(Fig. 3a and Supplementary Fig. 6c). SEC31 monoubiquitylation was 
also strongly diminished upon expression of dominant-negative 
CUL3 (Fig. 3g) or depletion of CUL3-KLHL12 by siRNA (Fig. 3i). 
As seen upon expression of lysine-free ubiquitin, SEC31 was mono- 
ubiquitylated at one preferred and an alternative, less prominently 
used lysine (Fig. 3g), consistent with proteomic analyses that iden- 
tified Lys 647 and Lys 1217 in SEC31A as ubiquitylation sites”°°. 
However, neither mutation of these residues nor any other of the 65 
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lysine residues of SEC31 blocked ubiquitylation by CUL3-KLHL12 
(data not shown), revealing flexibility in the actual modification site. 

Co-expression of KLHL12 and CUL3 triggered SEC31 multiubi- 
quitylation and degradation (Figs 3g, 4e and Supplementary Fig. 6d), 
which was not observed with lysine-free ubiquitin (Fig. 3g). However, 
whereas SEC31 was monoubiquitylated by endogenous CUL3- 
KLHL12, its multiubiquitylation was only seen when CUL3 and 
KLHL12 were overexpressed. Depletion of CUL3-KLHL12 or protea- 
some inhibition did not change SEC31 levels in untransfected cells 
(Fig. 3i and Supplementary Fig. 6e), and blockade of ubiquitin chain 
formation or proteasome inhibition did not impair CUL3-KLHL12 
function (see Fig. 5). Thus, multiubiquitylation of SEC31 is unlikely a 
key outcome of CUL3-KLHL12 activity in mouse ES cells. Instead, it 
seems that CUL3-KLHL12 acts by catalysing monoubiquitylation, 
with the COPII protein SEC31 as a major substrate. 


CUL3 regulates the size of COPII coats 

To identify a role for monoubiquitylation by CUL3-KLHL12, we 
induced KLHL12 expression in cells and followed the fate of SEC31 
by microscopy. Shortly after KLHL12 induction, the majority of 
KLHL12 and SEC31 colocalized in small punctae (Fig. 4a). Over time, 
these punctae grew into much larger structures that contained most of 
SEC31, as well as other COPII components, such as SEC13 or SEC24C 
(Fig. 4a, b). As seen by high-resolution confocal imaging, the large 
structures were hollow and spherical with a diameter of 200-500 nm, 
and they were decorated with the proteins of the COPII coat and 
with KLHL12 (Fig. 4c). Accordingly, thin-section electron microscopy 
revealed large, crescent-shaped tubules, possibly of endoplasmic reticu- 
lum origin, in cells transfected with KLHL12 (Fig. 4d). Immunogold- 
labelling electron microscopy showed comparable structures of 
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Figure 3 | CUL3-KLHL12 monoubiquitylates SEC31. 

a, Immunoprecipitates of Flag-KLHL12 or Flag—K1hl9 were analysed by silver 
staining and mass spectrometry. Asterisk, non-specific band; double asterisk, 
breakdown product of KLHL12. b, SEC13 was immunoprecipitated from HeLa 
cell lysates, and SEC31 and KLHL12 were detected by immunoblot. c, KLHL12 
colocalizes with COPII, as seen by confocal microscopy (KLHL12, green; 
SEC13, red; DNA, blue). Original magnification x60. d, D3 mouse ES cells 
grown on gelatin and depleted of SEC13 were analysed for compaction by phase 
(top) or confocal microscopy (actin, red; vinculin, green; DNA, blue). Original 
magnification 40. e, CUL3-KLHL12 monoubiquitylates SEC31. CUL3- 
NEDD8-RBX1 was incubated with KLHL12, SEC13/31 and ubiquitin (ubi) or 
His-ubiquitin (His-ubi). f, In vitro ubiquitylation of SEC31 by CUL3-KLHL12 


or CUL3-KLHL12(FG289AA) (FG289AA) was performed as above. g, SEC31 
is monoubiquitylated in vivo. Upper panels, ubiquitin conjugates were purified 
under denaturing conditions from MG132-treated 293T cells expressing His— 
ubiquitin, haemagglutinin-SEC31, KLHL12, CUL3 or dominant-negative 
CUL3 (dnCUL3), and analysed by anti-SEC31 Western blot. Lower panels, the 
same experiment was performed with lysine-free His—ubiquitin, which only 
allowed SEC31-monoubiquitylation on at least two sites (SEC31-ubi and 
SEC31-ubi*). SEC31-ubi-n denotes multiubiquitylated SEC31. h, Ubiquitin 
conjugates were purified from 293T cells expressing KLHL12 or SEC31- 
binding deficient KLHL12 mutants. i, CUL3 is essential for SEC31 
ubiquitylation in vivo. 293T cells were transfected with His—ubiquitin and 
siRNAs, and ubiquitin conjugates were analysed for SEC31 by Western blot. 
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Figure 4 | CUL3-KLHL12-dependent 
monoubiquitylation enlarges COPII-structures. 
a, Localization of doxycycline (dox)-induced Flag- 
KLHL12 (dox::K/h112, green) and SEC31 (red) in 
293T cells, monitored by confocal microscopy. 
Scale bar, 3 pm. b, KLHL12-expressing HeLa cells 
were analysed for KLHL12 (green) and SEC31, 
SEC13 or SEC24C (red) by confocal microscopy. 
Scale bar, 3 um. c. COPII-structures in HeLa cells 
transfected with Flag-KLHL12, lysine-free 
ubiquitin (ubi-KO) or Cul3-siRNA, analysed by 
confocal microscopy. Scale bar, 500 nm. d, Upper 
panel, thin-section electron microscopy (EM) of 
KLHL12-expressing or control HeLa cells (red 
arrow, KLHL12-dependent structures; blue arrow, 
small control vesicles). Scale bar, 500 nm. Lower 
panel, immunogold-EM of KLHL12 in transiently 
transfected HeLa (left) or stable 293T cells (right). 
Scale bar, 200 nm. e, HeLa cells transfected with 
Flag-KLHL12, lysine-free ubiquitin, Flag—- 
KLHL12(FG289AA), Flag—CUL3(1-250) or Flag— 
CUL3 were analysed for localization of KLHL12/ 
CUL3 (green) and SEC31 (red) by confocal 
microscopy. Original magnification 40. 
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200-500 nm that were decorated with KLHL12 (Fig. 4d). The KLHL12- 
dependent structures neither contained a cis-Golgi protein; ERGIC-53, 
which is absent from procollagen transport vesicles*'; endoplasmic 
reticulum membrane markers that do not accumulate at endoplasmic 
reticulum exit sites**; nor endosomal or autophagosomal markers 
(Supplementary Fig. 7a-c). Importantly, SEC31-binding deficient 
mutants, including KLHL12(FG289AA), neither colocalized with 
SEC31 nor induced formation of large structures (Fig. 4e and Sup- 
plementary Fig. 7d), and depletion of SEC31 blocked formation of large 
structures by KLHL12 (Fig. 4b). Thus, binding of KLHL12 to SEC31 
triggers formation of large COPII-containing structures. 

When KLHL12 was expressed with a CUL3 mutant that blocks 
SEC31 ubiquitylation (CUL3(1-250)), COPII structures were not 
enlarged (Fig. 4e). In addition, depletion of CUL3 by siRNAs, which 
also abolishes SEC31 monoubiquitylation, prevented formation of 
large COPII structures by KLHL12 (Fig. 4a, c). By contrast, if 
KLHL12 was expressed with lysine-free ubiquitin to allow mono-, 
but not multiubiquitylation, large COPII structures were readily 
detected (Fig. 4c, e), and these structures were enriched for ubiquitin, 
consistent with monoubiquitylation being non-proteolytic (Sup- 
plementary Fig. 7e). Thus, monoubiquitylation by CUL3-KLHL12 
promotes formation of large COPII structures, which probably rep- 
resent a mixture of nascent coats at endoplasmic reticulum exit sites 
and budded coats on large COPII vesicles or tubules. 


498 | NATURE VOL 482 | 23 FEBRUARY 2012 


CUL3 is required for collagen export 


Our screen linked CUL3-KLHL12 to the establishment of the stem cell 
extracellular matrix, which requires collagen secretion. Thus, the 
CUL3-KLHL12-dependent increase in COPII size might function to 
promote collagen export from the endoplasmic reticulum. To test this 
hypothesis, we expressed KLHL12 in IMR90 cells, which at steady state 
accumulate collagen in the endoplasmic reticulum due to inefficient 
export. Strikingly, KLHL12, but not KLHL12(FG289AA) or unrelated 
BTB proteins, triggered depletion of procollagen I from intracellular 
endoplasmic reticulum pools (Fig. 5a). As a result, increased collagen 
levels were detected in the supernatant of cells expressing KLHL12, but 
not KLHL12(FG289AA) (Fig. 5b). When secretion was inhibited with 
brefeldin A, or if collagen folding in the endoplasmic reticulum was 
impaired by removal of ascorbate from the medium, procollagen 
remained within KLHL12-expressing cells (Fig. 5a). Time-resolved 
experiments showed that KLHL12 strongly accelerated collagen export 
from IMR90 cells (Fig. 5c). Shortly after inducing secretion, KLHL12 
and collagen were detected at overlapping locations (Supplementary 
Fig. 7f), all of which indicates that CUL3-KLHL12 facilitates collagen 
traffic from the endoplasmic reticulum. 

Blockade of SEC31 ubiquitylation by dominant-negative CUL3 
interfered with the KLHL12-dependent export of collagen from 
IMR90 cells (Supplementary Fig. 8a). Similarly, depletion of CUL3- 
KLHL12 from engineered HT1080 fibrosarcoma cells severely 
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Figure 5 | CUL3-KLHL12 promotes collagen export. a, IMR90 cells 
transfected with Flag-KLHL12, Flag-KLHL12(FG289AA) or Flag-KEAP1 
were analysed by confocal microscopy (BTB, green; collagen-I, red; DNA, 
blue). When noted, cells were treated with chloroquine, MG132, brefeldin A 
(BFA) or dialysed medium lacking ascorbate. Errors bars, standard deviation 
n = 3. Original magnification 60. b, Cell lysate (L) or culture medium (M) of 
IMR90 cells transfected with Flag-KLHL12 or Flag-KLHL12(FG289AA) was 
analysed by immunoblotting. c, Collagen I localization was analysed in IMR90 


impaired collagen export, and most cells retained high levels of collagen 
in their endoplasmic reticulum (Fig. 5d and Supplementary Fig. 8b). In 
contrast, smaller COPII cargoes, such as fibronectin or EGF receptor, 
were properly localized in the absence of CUL3 (Supplementary Fig. 
8c, d). Similar observations were made in mouse ES cells, where deple- 
tion of CUL3 led to a strong intracellular accumulation of collagen IV, 
comparable to the effects observed upon loss of SEC13 (Fig. 5e and 
Supplementary Fig. 8e). Thus, CUL3-KLHL12 is required for collagen 
export, whereas it is less important for the trafficking of smaller COPII 
cargo. 

If promoting collagen export were the key role of CUL3 in mouse 
ES cells, the phenotypes of CUL3 depletion might be mitigated by 
addition of collagen in trans. Indeed, this was the case: when mouse ES 
cells were plated on purified collagen IV, depletion of CUL3 did not 
cause cell clustering, and integrin B1 was detected at the plasma 
membrane (Fig. 1b). We conclude that promoting collagen secretion 
is a key a function of CUL3, in agreement with its role in driving the 
assembly of large COPII coats. 


Discussion 

In this study, we have identified CUL3-KLHL12 as an essential regu- 
lator of collagen export, which is required for mouse ES cell division. 
Deletion of Cul3 in mice results in early embryonic lethality with 
completely disorganized extraembryonic tissues**, a phenotype that 
can in part be attributed to its role in collagen secretion. Moreover, 
KLHL12 has been identified as an autoantigen in the connective tissue 
disorder Sjogren’s syndrome™, raising the possibility that aberrant 
function of CUL3-KLHL12 might be related to disease. 


cells expressing KLHL12, after re-addition of ascorbate. Original magnification 
x40. d, HT1080 cells stably expressing collagen I were transfected with 
shRNAs against Cul3 and analysed by confocal microscopy (transfection 
control green fluorescent protein (GFP), green; protein disulphide isomerase 
(PDI), blue; collagen I, red). Error bars, standard deviation n = 3. Original 
magnification X60. e, D3 mouse ES cells were treated with control siRNAs or 
siRNAs targeting Cul3 or Sec13 and analysed by confocal microscopy (collagen 
IV, green; actin, red; DNA, blue). Original magnification x 40. 


CUL3-KLHL12 monoubiquitylates SEC31 and promotes forma- 
tion of large COPII coats that can accommodate unusually shaped 
cargo. As a result, CUL3 is essential for the secretion of procollagen 
fibres, whereas it is not required for the transport of smaller or more 
flexible molecules, such as fibronectin, EGF receptor or integrin B1. 
Thus, CUL3-KLHL12 seems to be specifically required for the 
COPII-dependent transport of large cargo. 

How ubiquitylation affects COPII coat size or structure is not 
known. None of the 65 lysine residues of SEC31 was essential for 
ubiquitylation by CUL3-KLHL12, showing that CUL3 can target 
alternative lysine residues if the primary site is blocked. Despite this 
flexibility, CUL3-KLHL12 does not stoichiometrically ubiquitylate 
SEC31. Thus, if SEC31 ubiquitylation performs a structural role, then 
few ubiquitylated molecules must suffice to produce large COPII 
coats, and these vesicles must tolerate considerable variation in the 
modification site. Alternatively, as often seen with monoubiquitylated 
proteins, modified SEC31 might recruit an effector that delays COPII 
budding or promotes coat polymerization. As CUL3-KLHL12 ubi- 
quitylates other proteins**, SEC31 may not be its only substrate in the 
secretory pathway. Identification of the complete set of CUL3- 
KLHL12 substrates and potential effector molecules should reveal 
the mechanism underlying the ubiquitin-dependent regulation of 
vesicle size. 

Our findings have the potential to be translated into therapeutic 
strategies. We envision that agonists of CUL3-KLHL12 function mitigate 
consequences of Sec23A mutations in cranio-lenticulo-sutural dysplasia 
or Sarl mutations in chylomicron retention disease’®"'. By contrast, 
interfering with CUL3 activity may counteract increased collagen 
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deposition during fibrosis or keloid formation*®. Given the strong 
clustering phenotypes observed in CUL3-depleted mouse ES cells, 
inhibition of CUL3-KLHL12 might impair the proliferation of 
metastatic cells, which display features of undifferentiated cells*”**. 
Thus, our identification of CUL3-KLHL12 as a regulator of COPII size 
and function provides an exciting starting point to understand and 


therapeutically exploit key events in protein trafficking. 


METHODS SUMMARY 


For stem cell culture, mouse D3 ES cells were maintained in GIBCO Dulbecco’s 
Modified Eagle ES cell medium containing 15% FBS, 1X sodium pyruvate, 1X 
non-essential amino acids, 1 mM B-mercaptoethanol and 1,000 U ml™ Meukaemia 
inhibitory factor (Millipore), and grown on gelatin-coated culture plates. 
Doxycycline-inducible 293T Trex Flag-BTB stable cell lines were made with the 
Flp-In T-REx 293 Cell Line system (Invitrogen) and maintained with blasticidin 
and hydromycin B. 

For screening, two siRNA oligonucleotides were designed against 40 mouse 
ubiquitin ligases (Qiagen). siRNA oligonucleotides (10 pmol) and Lipofectamine 
2000 were pre-incubated in a gelatin-coated 96-well plate. D3 mouse ES cells were 
seeded at 15,000 cells per well on top of the siRNA mixture, and the morphology of 
ES cell colonies was examined by bright-field microscopy 48 h after transfection. 

To identify CUL3-KLHL12 substrates, doxycycline-inducible 239T cell lines 
expressing Flag~-KLHL12 or Flag~KLHL9 were induced for 48 h. Cleared lysate 
was subjected to anti-Flag M2 affinity gel (Sigma), and precipitations were eluted 
with 3XFlag peptide (Sigma). Concentrated eluates were analysed by SDS- 
PAGE, and specific bands were identified by mass spectrometry analysis by the 
Vincent J. Coates Proteomics/Mass Spectrometry Laboratory. 

For in vitro ubiquitylation reactions, CUL3/RBX1 purified from Sf9 cells was 
conjugated to NEDD8 using recombinant APPBP1-UBA3, UBC12 (also known 
as UBE2M) and NEDD8. KLHL12 purified from Escherichia coli and SEC31A- 
SEC13 complexes from Sf9 cells were added together with energy mix, E1, 
UBCH5SC (also known as UBE2D3) and ubiquitin and incubated at 30 °C for 1h. 

For confocal microscopy, cells fixed in paraformaldehyde and permeabilized 
with Triton X-100 were incubated with primary antibodies for 2h and Alexa- 
labelled secondary antibodies (Invitrogen) for 1 h. Pictures were taken on Zeiss 
LSM 510 and 710 confocal microscopes and analysed with LSM image browser 
and Imaris 3D imaging processing software. Images were processed for contrast 
enhancement to remove noise. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Plasmids, protein, antibodies. Human Cul3 and KIhI12 were cloned into pcDNA4 
and pcDNAS vectors for expression in mammalian cells. Cul3, Sec31A and Sec13 
were also cloned into pCS2 vector for IVT/T and expression in mammalian cells. 
pcDNA4-Cul3‘” contains the first cullin repeat of the N-terminal CUL3 ( amino 
acids 1-250) which is sufficient for binding BTB proteins, but not RBX1 and serves 
as a dominant negative for CUL3/BTB-mediated ubiquitylation. The KLHL12 
mutants FG289AA, RL342AA, RGL369AAA, RE416AA, YDG434AAA and 
RCY510AAA were made by site-directed mutagenesis. 

CUL3 and RBX1 were cloned into pFastBac, co-expressed in Sf9 ES insect cells 
using the Bac-to-Bac baculovirus expression system (Invitrogen) and purified as a 
complex by Ni-NTA agarose (Qiagen). Similarly, the SEC31 A-SEC13 heterodimer 
and UBAI were purified from Sf9 ES insect cells. UbcH5c and Ubc12 were cloned 
into pQE vector and purified from BL21(DE3) bacterial cells. Ubiquitin was cloned 
into pET and pCS2 vector with a N-terminal 6X His tag. The pET-His-ubiquitin 
was used for bacterial purification whereas pCS2-His-ubiquitin was expressed in 
mammalian cells. Wild-type ubiquitin, APPBP1-UBA3 and NEDD8 were pur- 
chased from Boston Biochem. 

To purify recombinant KLHL12 for ubiquitylation assays, we expressed 
pMAL-TEV-KLHL12-his and pMAL-TEV-KLHL12°?""“-his in BL21(DE3) 
cells, purified the proteins on amylose resin, cleaved them by TEV protease, 
and re-purified them on Ni-NTA agarose. Wild-type K/hl12 and mutants were 
also cloned into pMAL vector and purified as maltose-binding protein (MBP)- 
tagged proteins for in-vitro protein binding assays. 

All shRNAs were cloned in pSuper-GFP neo vector (from Oligoengine) into 
BglII and Xho sites. The GFP-BCL2-CYB5 construct, a fusion between Bcl2 and 
cytochrome b5, was purchased from Clontech. 

We raised mouse monoclonal antibodies against human KLHL12 and human 
KLHL13. Both antibodies are available at Promab Biotechnologies (catalogue nos 
30058 and 30067). We also raised antibodies against SEC13, SEC24C and 
SEC24D. Other antibodies used in this study are: CUL3 (Bethyl Laboratories, 
catalogue no. A301-109A), SEC31A (BD Biosciences, catalogue no. 612350), 
collagen IV (Abcam, catalogue no. ab19808), anti-Flag (Sigma, catalogue nos 
F3165, F7425), Ubiquitin (Santa Cruz, catalogue no. sc-8017, P4D1), rhodamine 
phalloidin (Invitrogen, catalogue no. R415), PDI (1D3) (Assay Designs, catalogue 
no. SPA-891), anti LC-3 (Sigma, catalogue no. L-7543), anti-alpha tubulin 
(DM1A, Abcam, catalogue no. ab7291), anti-fibronectin (Abcam, ab2413), 
anti-GM130 (BD Biosciences, catalogue no. 610822), and anti-EGFR (Ab12, 
Neomarkers, MS-400P1). LF-67 (anti-sera for Type I procollagen) was obtained 
as a gift from L. Fisher. 

Cell culture. The D3 mouse embryonic stem cells (mouse ES cell) were maintained 
in ES cell medium containing 15% FBS, 1X sodium pyruvate, 1X non-essential 
amino acids, 1 mM f-mercaptoethanol and 1,000 U ml™! leukaemia inhibitory 
factor (Millipore, catalogue no. ESG1107) in GIBCO Dulbecco’s Modified Eagle 
Medium, and grown on 0.1% gelatin-coated tissue culture plates. HeLa cells, 293T 
cells, 3T3 cells and IMR90 cells were maintained in DMEM plus 10% FBS. Dialysed 
FBS was bought from HyClone. The doxycycline-inducible 293T Trex KLHL12- 
3 Flag stable cell line was made with Flp-In T-REx 293 Cell Line system from 
Invitrogen. Stable cell lines expressing other BTB-proteins were generated accord- 
ingly. These cell lines were maintained with 10% TET(—) FBS, blasticidin and 
hydromycin B as instructed and expression was induced by 1 jg ml’ doxycycline. 

Human lung fibroblasts IMR-90 cells were obtained from the Corielle Institute: 
NIA (National Institute on Ageing) Ageing Cell Repository. For generating pro- 
collagen stable HT-1080cell lines, we cloned proalpha(1) into a pRMc/CMV- 
vector and selected for neomycin resistance”. This vector was provided as a gift 
by N. Bulleid. Cells were kept in a 37 °C incubator with 5% CQ). 
siRNA screen in mouse ES cells. siRNA oligonucleotides against 40 mouse 
ubiquitin E3 enzymes were pre-designed by Qiagen and handled as instructed. 
Two different siRNA oligonucleotides against each gene were included in the initial 
screen. 10 pmol of siRNA oligonucleotides and 0.25 il of Lipofectamine2000 were 
pre-incubated in a 0.1% gelatin-coated 96-well plate in 20 ul of OPTIMEM for 
15 min at room temperature. The D3 mouse ES cells were trypsinized and seeded 
at 15,000 cells per well in 80 kl of ES cell medium on top of the siRNA mixture. Fresh 
medium was added to the cells the next day and the morphology of ES cell colonies 
were examined using bright-field microscopy at 48h post transfection. Hit valid- 
ation was performed with additional siRNAs that were purchased from two distinct 
vendors (Qiagen, Dharmacon) and that target different sites of the Cul3 mRNA. 
Knockdown efficiency was tested by qRT-PCR and immunoblot. 

Rescue of Cul3-siRNA phenotype in mouse ES cells by Matrigel and collagen 
IV. D3 mouse ES cells were grown on tissue culture dishes coated with gelatin 
(negative control), growth-factor-depleted Matrigel (BD Biosciences, catalogue 
no. 356231), or purified collagen IV (BD Biosciences, catalogue no. 354233). 
Matrigel and collagen IV were applied at 10 jigcm™*. CUL3 was depleted 24h 
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later using our standard siRNA transfection protocol, and mouse ES cell morpho- 
logy was analysed by confocal microscopy against integrin B1, actin and DNA. 
Drug treatments of CUL3-depleted cells. To study the synthetic lethal effect of 
SRC-inhibition with CUL3 knockdown, we treated wild-type and CUL3-depleted 
D3 mouse ES cells with 0, 25, 50 or 100nM of dasatinib for 18h before the 
phenotypes were analysed by light microscopy. 

To study the effect of RHO-inhibition on CUL3 knockdown, CUL3-depleted D3 

mouse ES cells were treated with ROCK inhibitor Y27632 at 10 uM for 24h before 
phenotype analysis. Alternatively, RHOA was co-depleted using specific siRNAs. 
Cell cycle analysis. To assess the division rate of CUL3-depleted mouse ES cells, 
we treated cells with control, Cul3-, or Ube2C/Ube2S-siRNA and seeded at 
3 X 10° cells per well in gelatin-coated six-well plates. The specificity of Ube2S- 
and Ube2C-siRNAs was tested before*’. The cells were trypsinized at 2, 3 and 
4 days post transfection and counted by haemocytometer. 
ES cell differentiation analysis. To differentiate mouse ES cells into embryoid 
bodies (EB), we trypsinized undifferentiated D3 mouse ES cells, washed once with 
leukaemia inhibitory factor-free ES cell media, and seeded the cells at 2 x 10° cells 
per dish onto 10-cm Corning Ultra-Low- Attachment Dishes (Corning catalogue no. 
3262) containing 10 ml of ES cell medium without leukaemia inhibitory factor. After 
24h, the cells were dissociated from the plate by gentle pipetting of the medium and 
collected in a 15-ml Falcon tube by centrifugation. The supernatant was aspirated off 
and the cells were re-seeded onto 10-cm Corning Ultra-Low-Attachment Dishes 
containing fresh ES cell medium without leukaemia inhibitory factor. Medium was 
changed every other day for a total of 6 or 9 days. Total RNA of ES cells and EB 
samples was extracted using TRIzol (Invitrogen, catalogue no. 15596-026) and 
chloroform. The expression of pluripotent markers and BTB genes at various time 
points during differentiation was analysed using quantitative real-time PCR. 

As a complementary experiment, D3 mouse ES cells were treated with control 
or Oct4 siRNA. 48h after transfection, cells were collected and total RNA was 
extracted using TRIzol as above. The expression of pluripotent markers, tissue 
specific genes and BTB genes in control and OCT4-depleted cells were analysed 
using qRT-PCR. 

Quantitative real-time PCR analysis. We used TRIzol (Invitrogen, catalogue no. 
15596-026) and chloroform to extract total RNA from cells. The first-strand cDNAs 
were synthesized by using RevertAid first strand cDNA synthesis kit (Fermentas, 
catalogue no. K1621). Gene-specific primers for RT-PCR were designed by using 
NCBI Primer-Blast. The quantitative RT-PCR reaction was done with the Maxima 
SYBR Green/Rox qPCR system (Fermentas, catalogue no. K0221). 
Identification of CUL3-KLHL12 substrates. To identify CUL3-KLHL12 sub- 
strates, we generated a doxycycline-inducible human KLHL12-3 x Flag stable cell 
line using the Flp-In T-REx 293 Cell Line system (Invitrogen). As controls, we 
generated stable cell lines expressing other BTB proteins including KLHL9. 
KLHL12-3xFlag and KLHL9-3Flag expression was induced in 30 X 15cm 
plates by 1 pg ml’ of doxycycline for 48h, and cells were collected by centrifu- 
gation and lysed by douncing 40 times in PBS+0.1%NP40. The cell lysate was 
cleared by centrifugation and then subjected to anti-Flag M2 affinity gel (Sigma, 
catalogue no. A2220-5mL) at 4 °C for 4h ona rotator. Immunoprecipitations were 
eluted by 300 pil of 200 pg ml ' 3 Flag peptide (Sigma, catalogue no. F4799-4MG) 
in PBS. The elution was repeated three times for 1 h at room temperature. Eluates 
were pooled, concentrated to 100ul using Amicon Ultra-0.5, Ultracel-10 
Membrane (Millipore, catalogue no. UFC501008) and run on a SDS-PAGE gel. 
The gel was stained by SimplyBlue SafeStain (Invitrogen, catalogue no. LC6060), 
and specific gel bands were cut out and sent for mass spectrometry analysis by the 
Vincent J. Coates Proteomics/Mass Spectrometry Laboratory at UC Berkeley. 
Immunoprecipitation of endogenous protein complexes. To confirm the inter- 
action of endogenous proteins, we lysed HeLa cells or D3 mouse ES cells by 
freeze-thaw twice in 20mM HEPES buffer pH 7.5, 5mM KCl, 1.5mM MgCh, 
1X protease inhibitor cocktail (Roche). Specific antibodies against CUL3, SEC13 
or SEC31 conjugated to protein G agarose beads were added to the cleared cell 
lysate and incubated at 4°C for 4h. Protein complexes were eluted with gel- 
loading buffer at 95°C. Endogenous proteins in complexes were detected by 
immunoblot using specific antibodies against CUL3, SEC13, SEC31 or KLHL12. 

To detect ubiquitylation of endogenous COPII components, we incubated 
HeLa cell extract with pre-immune serum or antibody against SEC13 conjugated 
to protein G agarose beads at 4 °C for 4 h. Protein complexes were eluted with SDS 
gel-loading buffer at 95 °C. Ubiquitylated proteins in the complex were detected 
by immunoblot against ubiquitin. 

In vitro protein interaction assays. To dissect the KLHL12 and SEC31A inter- 
action, we coupled 20 pg recombinant MBP-KLHL12, various mutants or MBP 
as a control to 15 pl amylose resin by incubating at 4°C for 1h. CUL3, SEC31A 
and mutants were expressed from pCS2 and labelled with [*°S]-Met using TnT 
Sp6 Quick Coupled Trsnc/trans Syst (Promega, catalogue no. 12080). The 
labelled CUL3 or SEC31A were incubated with MBP-KLHL12 or mutants at 
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4°C for 3h. Beads were washed four times with TBST and twice with TBS, and 
incubated in SDS loading buffer at 95 °C. Samples were run on SDS-PAGE and 
results were visualized by autoradiography. 

In vitro ubiquitylation assays with CUL3-KLHL12. CUL3/RBX1 was conju- 
gated to NEDD8 at 30°C for 1h with the following conditions: 2.5 mM Tris/HCl 
pH7.5, 5mM NaCl, 1 mM MgCh, 1 mM DTT, 1X energy mix”, 1 uM APPBP1- 
UBA3, 1.2uM UBC12, 44M CUL3/RBX1, and 604M NEDD8. For in vitro 
ubiquitylation of SEC31A, we set up a 101 reaction as follows: 2.5mM Tris/ 
HCl pH 7.5, 5mM NaCl, 1mM MgCl2, 1mM DTT, 1X energy mix, 100nM 
UBAI, 144M UBCHS5C, 1 uM CUL3~NEDD8/RBX1, 1M KLHL12, 150 uM 
ubiquitin, 0.05 ug SEC13/31A. The reaction was carried out at 30°C for 1h 
and stopped by adding SDS gel loading buffer. 

In vivo ubiquitylation assays with CUL3-KLHL12. 293T cells grown in 10-cm 
dishes were transfected with pCS2-HA-Sec13/31A, pCS2-His-ubiquitin, pcDNA5- 
Klhl12-FLAG, pcDNA4-Cul3-FLAG, or pcDNA4-Cul3”°°-FLAG, as indicated, 
using calcium phosphate. 24h later, 1 1M MG132 was added and cells were incu- 
bated overnight. Cells were harvested with gentle scraping and resuspended in 1 ml 
buffer A (6M guanidine chloride, 0.1 M NagHPO,/NaH>PO, and 10 mM imida- 
zole, pH 8.0). Cells were lysed by sonication for 10s and incubated with 25 pl Ni- 
NTA agarose at room temperature for 3 h. The beads were washed twice with buffer 
A, twice with buffer A/TI (1 volume buffer A and 3 volumes buffer TI), once with 
buffer TI (25 mM Tris-Cl, 20 mM imidazole, pH 6.8), and incubated in 60 pl SDS 
gel-loading buffer containing 300 mM imidazole and 50 mM f-mercaptoethanol at 
95°C. Samples were separated by SDS-PAGE and ubiquitylated SEC31A was 
detected by immunoblot using antibody against SEC31A. 

To detect SEC31A ubiquitylation upon CUL3/KLHL12 depletion, we co- 
transfected 100nM siRNAs against CUL3 or KLHL12 with pCS2-HA-Sec13/ 
31A and pCS2-His-ubiquitin using calcium phosphate. The Ni-NTA purification 
was performed 48 h post transfection and SEC31A ubiquitylation was detected as 
described above. 

Confocal microscopy. Cells were fixed in 4% paraformaldehyde and permeabilized 
with 0.5% Triton X-100 in 1X TBS, 2% BSA. Cells were incubated with primary 
antibodies against SEC31A, SEC13, SEC24C, ERGIC53, CD63, BiP (also known as 
HSPAS) or ubiquitin for 2 h and secondary antibodies (Invitrogen, Alexa Fluor 546 
goat anti-rabbit IgG (H+L); Alexa Fluor 488 goat anti-mouse IgG (H+L); 
HOECHST 33342,) for 1h at room temperature followed by extensive washing. 
Pictures were taken on Zeiss LSM 510 and 710 Confocal Microscope systems and 
analysed with LSM image browser and Imaris 3D imaging processing software. 
Transmission electron microscopy. Mock- and KLHL12-transfected HeLa cells 
were grown to 70% confluence as a monolayer on an Aclar sheet (Electron 
Microscopy Sciences). The cells were fixed for 30 min in 0.1 M cacodylate buffer, 
pH 7.2, containing 2% glutaraldehyde, and subsequently washed with buffer before 
post-fixation with 1% osmium tetroxide on ice. This was followed by staining 
with 1% aqueous uranyl acetate for 30 min at room temperature. For dehydration 
with progressive lowering of temperature, each incubation period was 10 min, with 
exposure to 35% ethanol at 4 °C, to 50% ethanol and 70% ethanol at —20 °C, and 
95%, and 100% ethanol at —35°C. Cells were restored to room temperature in 
100% ethanol before flat embedding in an Epon resin. Thin (70-100 nm) sections 
were collected on Formvar-coated 200-mesh copper grids and post-stained with 2% 
aqueous uranyl acetate and 2% tannic acid. The sections were imaged at 120 kV 
using a Tecnai 12 Transmission Electron Microscope (FEI). 

For the purpose of immunolabelling, HeLa cells expressing Flag-KLHL12 or 
doxycycline-inducible 293T Trex Flag-KLHL12 stable cell lines were fixed in 2% 
paraformaldehyde and 0.5% glutaraldehyde and embedded in LR white resin. 
Fixation and infiltration were performed in a microwave oven (Pelco model 3450, 
Ted Pella). 70-nm thick sections were picked on 100-mesh nickel grids coated 
with Formvar film and carbon, incubated in blocking buffer (5% BSA, 0.1% fish 
gelatin, 0.05% Tween 20 in PBS) for 30 min, and followed by incubation with anti- 
Flag antibody at a dilution of 1:40 for 1h. Goat anti-mouse IgG conjugated with 
10-nm gold (BD Biosciences) was used as the secondary antibody at a dilution of 
1:40 for 1h. Sections were post stained in 2% uranyl acetate for 5 min. 

Gene expression analysis by microarray. To compare gene expression profiles of 
wild-type mouse ES cells versus CUL3-depleted mouse ES cells, we transfected D3 
mouse ES cells with control or Cul3-siRNA, followed by growth on gelatin-coated 
six-well plates. 48 h later, total RNA was extracted by TRIzol and chloroform, and 
further purified using RNeasy Mini Kit (Qiagen, catalogue no. 74104). 
Microarray analysis was performed by the Functional Genomics Laboratory 
(UC Berkeley) using Affymetrix Mouse 430A 2.0 chip. 

Analysis of collagen export from cells. IMR-90 human lung fibroblasts grown 
on 100-mm dishes in DMEM/10% FBS were transfected with Flag~-KLHL12, 
Flag~KLHL12(FG289AA), Flag~-KEAP1 and pcDNAS-flag using nucleofection 
kit R (bought from Lonza) as described in the manufacturer’s protocol and plated 
on six-well plate with 25-mm coverslips. When indicated, co-transfections with 


24g each of Flag~KLHL12 and dominant-negative CUL3 were performed. 
Dialysed 10% FBS media was used for ascorbate free transfections. Brefeldin A 
(Sigma) was used at a concentration of 2.5 mg ml * and cells were incubated for 
30 min. MG132 was used at 20 1M for 2h, chloroquine was used at 200 UM for 
1h. Media was collected the next day and cells on coverslips were fixed with 3% 
paraformaldehyde for 30 min and remaining cells on a plate were used to prepare 
lysates. Cells on coverslips were permeabilized with 0.1% Triton for 15 min at 
room temperature followed by blocking with 1%BSA for 30 min. Primary antibodies 
used were polyclonal anti-procollagen (LF-67,diluted 1:1,000) and anti-Flag (diluted 
1:200). Secondary antibodies were Alexa Fluor 546 donkey anti-rabbit IgG and Alexa 
Fluor 488 goat anti-rabbit IgG (diluted 1:200). After staining cells with appropriate 
primary and secondary antibodies, we fixed coverslips on slides using mounting 
reagent containing DAPI. Images were analysed with a Zeiss LSM710 confocal 
microscope and captured with Zen10 software. Merges of images were performed 
with ImageJ and LSM image Browser. Media collected from six-well plates was 
normalized with respect to lysate protein concentration estimated using BCA 
method. Media and lysates of each reaction were checked by immunoblot analysis. 
Tubulin was used as loading control for lysates. Ascorbate chase experiments were 
done by adding ascorbate (0.25 mM ascorbic acid and 1 mM asc-2-phosphate) to 
KLHL12-transfected cells, followed by incubation for 5, 10, 30 and 60 min. 

A human fibrosarcoma cell line (HT1080) stably transfected with proalphal(1) 
was used for CUL3 knockdowns. Cul3- and Kih12-shRNAs targeting two different 
regions in both genes were cloned into pSuperGFP and transfected using 
Lipofectamine 2000. pSuper GFP was used as negative control. Cells were grown 
on 25-mm coverslips in six-well plates and fixed 2 days post transfection. Collagen 
staining was done using LF-67 (1:1,000) and endoplasmic reticulum was stained with 
anti-PDI (1:1,000) antibody. Fibronectin and EGER were stained in parallel experi- 
ments. Fibronectin expression was induced in HT 1080 using 1 uM dexamethasone 
before CUL3 knockdowns. Endoplasmic reticulum retention or secretion was scored 
in cells expressing GFP shRNAs. Cells without GFP shRNAs and transfected with 
pSUPER GFP were quantified as well. Images were taken on a Zeiss LSM 710 
confocal microscope and visualized with LSM image browser. Lysates were prepared 
from remaining cells on six-well plates and checked for knockdown efficiency. 
siRNA oligonucleotides used in this study. RNA interference oligonucleotides: 
mCul3 #1, GAAGGAATGTTTAGGGATA; mCul3 #2, GGAAGAAGATGCAG 
CACAA; mCul3 #3, GGTGATGATTAGAGACATA; mCul3 #4, CAACTTTCT 
TCAAACACTA; mCul3 #5, CATTATTTATTGATGATAA; mUBA3, CGTTTG 
AAGCAGAGAGAAA; mklhl12, CCTTGAGAGTGGAGCAGAA; hkthl12, 
CCAAAGACATAATGACAAA; mKBTBD8, GAACATGAGCAGAGTGAAA; 
mOct4, AGGCAAGGGAGGTAGACAA; hSec31, CCTGAAGTATTCTGAT 
AAA; mSecl13 (pool of 4 oligonucleotides), CCATGTGTTTAGTAATTTA, 
GGCAATATGTGGTCACCTA, GCTGAAAGTATTCATGTAA and GGAAC 
AAATGACTATTATT; mCdc42 (pool of 4 oligonucleotides), GATCTAATT 
TGAAATATTA, GGATTGAGTTCCTAATTAA, AGAGGATTATGACAGAC 
TA and AAATCAAACTAAAGATTAA; mBcar1/CAS (pool of 4 oligonucleotides), 
GACTAATAGTCTACATTTA, GGAGGTGTCTCGTCCAATA, CTATGACA 
ATGTTGCTGAA and GGGCGTCCATGCTCCGGTA; mSrc (pool of 4 oligonu- 
cleotides), CCCTTGTGTCCATATTTAA, CCACGAGGGTTGCCATCAA, CA 
GACTTGTTGTACATATT and GCAACAAGAGCAAGCCCAA; mRhoG (pool 
of 4 oligonucleotides), GGTTTACCTAAGAGGCCAA, GCTGTGCCTTAAG 
GACTAA, GCACAATGCAGAGCATCAA and GGCGCACCGTGAACCTA 
AA; mRhoA (pool of 4 oligonucleotides), GGATTTCCTAATACTGATA, 
GAAAGTGTATTTGGAAATA, AGCCCTATATATCATTCTA, CGTCTGCCA 
TGATTGGTTA; mRacl (pool of 4 oligonucleotides), GGTTAATTTCTGTCA 
AACA, GCGTTGAGTCCATATTTAA, GCTTGATCTTAGGGATGAT and 
GGAGTAATTCAACTGAATA; mCdh1/E-cadherin (pool of 4 oligonucleotides), 
GGAGGAGAACGGTGGTCAA, CGCGGATAACCAGAACAAA, CCATGTTT 
GCTGTATTCTA and GGGACAATGTGTATTACTA; mlqgap1 (pool of 4 
oligonucleotides), ACATGATGATGATAAACAA, GGTTGATTTCACAGAAGAA, 
GTATAAATTTATTTCITAA and GGTGGATCAGATTCAAGAA; mCull 
(pool of 2 oligonucleotides), GCATGATCTCCAAGTTAAA and CGTGTAATC 
TGCTATGAAA; mCul2 (pool of 2 oligonucleotides), GCGCTGATTTGAAC 
AATAA and CCAGAGTATTTATATCTAA; mCul4a (pool of 2 oligonucleotides), 
GTGTGATTACCATAATAAA and CCAGGAAGCTGGTCATCAA; mCul5 
(pool of 2 oligonucleotides), CCCTCATATTTACAGCAAA and ACATGAAGTT 
TATAATGAA; mCul7 (pool of 2 oligonucleotides), GCATCAAGTCCGTTAA 
TAA and GGATGTGATTGATATTGAA. 
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Ribosome-driven protein biosynthesis is comprised of four phases: initiation, elongation, termination and recycling. In 
bacteria, ribosome recycling requires ribosome recycling factor and elongation factor G, and several structures of 
bacterial recycling complexes have been determined. In the eukaryotic and archaeal kingdoms, however, recycling 
involves the ABC-type ATPase ABCE] and little is known about its structural basis. Here we present cryo- 
electron microscopy reconstructions of eukaryotic and archaeal ribosome recycling complexes containing ABCE] and 
the termination factor paralogue Pelota. These structures reveal the overall binding mode of ABCEI to be similar to 
canonical translation factors. Moreover, the iron-sulphur cluster domain of ABCE] interacts with and stabilizes Pelota in 
a conformation that reaches towards the peptidyl transferase centre, thus explaining how ABCE1 may stimulate 
peptide-release activity of canonical termination factors. Using the mechanochemical properties of ABCE1, a 
conserved mechanism in archaea and eukaryotes is suggested that couples translation termination to recycling, and 


eventually to re-initiation. 


Recycling of ribosomes for a new round of translation initiation is an 
essential part of protein synthesis. In archaea and eukaryotes recycling 
has been shown to require the highly conserved and essential ABC- 
type ATPase ABCE1 (Rlilp in Saccharomyces cerevisiae with 46.7% 
identity to archaeal (a)ABCE1 in Pyrococcus furiosus)'*. ABCE1 can 
dissociate ribosomes into subunits either after canonical termination 
by release factors* or after recognition of stalled ribosomes by messenger 
RNA surveillance factors such as Pelota (Dom34p in S. cerevisiae, 
aPelota in P. furiosus)°. Crystal structures of aABCE1 revealed two 
nucleotide-binding domains (NBDs) in a typical head-to-tail orienta- 
tion as observed for most of the other members of the ABC protein 
family® *. Additional unique structural features of ABCE1 proteins are 
a helix-loop-helix (HLH) motif, a highly conserved hinge domain 
and an iron-sulphur cluster domain (FeS) containing two [4Fe-4S]** 
clusters®?. 

In eukaryotes ABCE1 can be found associated with ribosomes and 
small ribosomal subunits, but also with release factors and initiation 
factors (eRF1, eIF2, eIF3 and eIF5)'®"'. Notably, ABCE1 physically 
interacts with eRF1 and directly influences its function in stop-codon 
recognition and peptidyl-transfer RNA (tRNA) hydrolysis’*"*. During 
recycling ABCE1 can split post-termination complexes obtained with 
eRF1 and eRF3 into free 60S subunits and tRNA- and mRNA-bound 
40S subunits*. A similar role for ABCE1 was found in an archaeal 
translation system in which aABCE] together with aRF1 was shown 
to dissociate ribosomes into subunits upon ATP binding®. 

ABCE1 also acts together with the eRF1 paralogue Pelota®. In S. 
cerevisiae, Dom34 and the eRF3 paralogue Hbs1 were described as 
mRNA surveillance factors recognizing stalled elongating ribosomes”. 


Such stalls can occur on mRNAs with stable secondary structures, 
truncations or lacking a stop codon, so that further elongation or 
canonical termination is prevented. In the so called no-go mRNA 
decay (NGD) or non-stop mRNA decay (NSD) pathways, such stalled 
ribosomes are recognized by Dom34 and Hbsl (NGD and NSD, 
respectively)'* or by another eRF3 paralogue Ski7 (NSD), eventually 
triggering mRNA degradation'*"’. A cryo-electron microscopy (cryo- 
EM) structure ofa stalled ribosome bound to Dom34—Hbs1 shows that 
Dom34 occupies the ribosomal A site, whereas Hbs1 binds the ribosome 
similar to other translational GTPases, such as elongation factor Tu (EF- 
Tu)’. Dom34-Hbs1 alone shows ribosome dissociation activity and 
splits stalled reconstituted ribosomes that contain P-site peptidyl- 
tRNA”. In a mammalian system, however, ABCE1 is strictly required 
for ribosome disassembly of both programmed and vacant ribosomes 
by Pelota and Hbs15 (ref. 5). Taken together, ABCE1 is probably the 
general ribosome recycling factor in archaea and eukaryotes. In contrast 
to the analogous bacterial system, however, ABCE] acts not only after 
canonical release-factor-dependent termination but also after Pelota- 
dependent recognition of stalled ribosomes. 

It is not known how ABCE1 functions on the ribosome in concert 
with Pelota or release factors, and how the mechanochemical properties 
of ABCE] are used for ribosome recycling. To address these questions, 
we determined cryo-EM structures of eukaryotic and archaeal recycling 
complexes containing Pelota and ABCE1. 


Model of Pelota-ABCE1-ribosome complexes 


Recycling complexes were obtained by in vitro reconstitution of the 
70S and 80S ribosomes with purified Pelota and ABCE1 orthologues. 
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For the generation of a S. cerevisiae 80S ribosome-Dom34-Rlil 
complex we used ribosome nascent chain complexes (RNCs) stalled 
by an mRNA with a synthetic stem loop (SL)'*', a complex used 
previously for an 80S-Dom34-Hbs1 cryo-EM reconstruction’. For 
archaeal (P. furiosus) 70S—aPelota-aABCE1 complexes, 70S ribosomes 
were purified from a translation extract”. Simultaneous ribosome 
binding of Pelota and ABCEI in the presence of non-hydrolysable 
ADPNP was shown by pelleting assays in the yeast and archaeal 
systems (Supplementary Fig. la, b). Notably, aABCE1-dependent 
splitting of archaeal ribosomes was not detectable with ADPNP, but 
strictly required ATP (Supplementary Fig. 1c, d). 

Using cryo-EM in combination with single-particle analysis, we 
determined the structures of the SL-RNC-Dom34-Rlil complex 
from yeast and the 70S-aPelota-aABCE1 complex from P. furiosus. 
Computational sorting was performed to generate homogeneous 
populations of ribosomal complexes containing Pelota, ABCE1 and 
P-site tRNA. The resolution of the final maps was determined to be 
7.2A for the yeast complex and 6.6A for the archaeal complex 
(Supplementary Fig. 2). In both reconstructions we observed density 
for Pelota in the ribosomal A site, for ABCE1 in the GTPase trans- 
lation factor binding site, and for tRNA in the P site (Fig. la, b). 
Additional E-site tRNA density is present in the archaeal ribosome. 
Both reconstructions are remarkably similar with respect to con- 
formation and the ribosomal interaction patterns of the ABCE1 and 
Pelota orthologues. Using available crystal structures we could 
unambiguously assign and position the individual domains of 
Pelota—divided into amino-terminal domain (NTD), central domain 
and carboxy-terminal domain (CTD)—and ABCE1—divided into the 
N-terminal FeS, NBD1 containing a HLH motif, NBD2 and the hinge 
domain (Fig. 1c and Supplementary Fig. 3). 

Notably, the two electron dense [4Fe-4S]** clusters of ABCE1 can 
be clearly resolved as distinct spheres at high contour levels in both the 
yeast and archaeal maps reconstructions, validating the positioning of 
crystal structures in the EM maps (Fig. 1d). For molecular analysis we 
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used the crystal structure of the yeast 40S subunit”*, the model of the 
yeast 80S ribosome**”’ and, in addition, we built a homology-based 
molecular model of the archaeal 70S ribosome. 


Ribosome-ABCE1 interaction 
ABCEI binds to ribosomes in the intersubunit space, where canonical 
translational GTPases such as EF-Tu, EF-G/eEF2 or Hbs! also inter- 
act with the ribosome (Fig. 2a)'°***?. ABCE1 excludes these factors 
from binding at the same time, and we thus conclude that dissociation 
of Hbs1 or aEF1« or, in the case of termination, eRF3 or aEF14, has to 
precede ABCE1 binding. Similar to these GTPases, the ATPase 
ABCEI contacts the small ribosomal subunit, specifically ribosomal 
RNA helices h5, h8, h14 and h15 (Supplementary Tables 1 and 2). The 
h5-h15 region interacts with domain II of the translational GTPases, 
whereas the h8-h14 junction is the most proximal region to the 
GTPase switch regions*®*' (Supplementary Fig. 4). Interestingly, the 
same regions are contacted by ABCE1 via two specific, up to now 
unexplained, structural features of ABCE1-type ABC-ATPases. The 
HLH motif of ABCE1 contacts the h5-h15 junction, whereas the 
hinge region establishes extensive contacts with the h8-h14 junction. 
In contrast to translational GTPases that engage in close interaction 
with the sarcin-ricin loop (SRL) of the rRNA helix, H95, contacts of 
ABCE1 with the large subunit are essentially limited to L9 in both 
species. Despite the overall marked similarity between Rlil and 
aABCE1 in their ribosome interaction mode, additional minor 
contacts are present in the yeast complex: Rlil contacts rpS6e and 
rpS24e on the small subunit, and, on the large subunit, rpP0 and a 
small region of the SRL (H95), which is different from the binding 
region of translational GTPases (Fig. 2b, c). Unexpectedly, the FeS 
cluster domain of ABCE1 does not directly bind the ribosome but 
instead interacts with Pelota only. These interactions are conserved 
between yeast and archaea. 

In summary, ABCE1 establishes multiple contacts with both small 
and large ribosomal subunits as well as with the release factors (and 
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Figure 1 | The ribosome-bound Pelota-ABCE1 complex. a, b, Cryo-EM 
reconstructions of the eukaryotic SL-RNC-Dom34-Rlil and the archaeal 70S- 
aPelota~aABCE1 complexes at 7.2 A and 6.6 A resolution, respectively. Extra 
densities were observed for Dom34/aPelota and Rlil/aABCE] in the canonical 
factor binding site as well as for P-site tRNA, E-site tRNA and mRNA. The top 
section represents side views, the bottom section top views, where large and 
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small subunits were cut. c, Homology model for ribosome-bound Pelota and 
ABCE1 in transparent density. The individual domains are colour-coded as in 
the schematic representation of domain organization. The NTD, central 
domain (ce) and CTD are indicated. H1 and H2 indicate hinge 1 and hinge 2 
domains. d, Zoom on the FeS domain of aABCE1. The density for the two [4Fe-— 
4S]°* clusters is displayed in red mesh at high contour level. 
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Figure 2 | Interaction of Pelota and ABCE1 with the ribosome. 

a, Comparison of the SL-RNC-Dom34-Rlil cryo-EM map with the SL-RNC- 
Dom34-Hbs1 and the 80S-eEF2 maps. Views are as in Fig. la, b. 

b, c, Interactions of ABCE1 with the eukaryotic (b) and the archaeal 

(c) ribosome. The view is indicated by a thumbnail. The domain colour code is 
as in Fig. 1c. 


their paralogues) and these interactions involve all domains of 
ABCE1. Such tight recognition provides a rationale for direct 
mechanochemical coupling of ATP-driven conformational changes 
in ABCE1 with structural changes in the ribosome that are critical for 
termination and release. 


ABCE]-stabilized conformational switch of Pelota 

The FeS domain of ABCE] binds to the CTD of Pelota and we observe 
a large-scale conformational change in the central domain and CTD 
compared to the Dom34 structure in the Hbs1-bound state’? (Fig. 3). 
By contrast, the NTD of Pelota is essentially unchanged in these two 
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structures where they are located in the A site contacting rRNA helices 
h18, h28, h31, h34 and h44, and additionally the ribosomal protein 
rpS30 in yeast. The 83-84 loop reaches deeply into the A site and at 
the given resolution we observed additional contacts of this loop of 
aPelota with the ribosomal protein S5 (Supplementary Fig. 3c). The 
most marked rearrangements, however, occur in the central domain 
of both Pelota orthologues: the central domain of Dom34 bound to 
Hbs]1 in the yeast ribosome is tightly packed against Hbs1 (ref. 19), 
very similar to the domain arrangement in the crystal structure of an 
aPelota-aEFla complex” (Fig. 3b); in the presence of ABCE1, 
however, the central domain of Dom34 or aPelota is rotated by 
approximately 140° towards the P-site tRNA (Supplementary 
Movies 1 and 2). In this conformation it establishes numerous new 
contacts to rRNA in domain IV and V of the large subunit (Sup- 
plementary Tables 1 and 2). The positively charged loop B10-a3 
directly contacts the P-site tRNA acceptor stem and the ribosomal 
protein L10e in archaea. In the closely related RF1 proteins, this loop 
contains the GGQ motif that is essential for catalysing the hydrolysis 
of peptide from peptidyl-tRNA***. When modelling ribosome- 
bound eRF1 on the basis of the Pelota conformation observed in 
the presence of ABCE1, the GGQ motif of eRF1 would be ideally 
positioned to interact with the CCA-end of the P-site peptidyl- 
tRNA in the peptidyl transferase centre (Fig. 3c). This may explain 
how ABCEI can stimulate termination activity in vivo and in vitro’. 

As the CTD of Pelota establishes the only contact site with ABCE1 
via the FeS cluster domain, the interaction surface is rather small 
(440 A?) compared to that between ribosome-bound Dom34 and 
Hbs1 trapped in the GTP state (1,940 A”). In the ABCE1-bound 
conformation, the CTD is rotated downwards by approximately 15° 
together with a movement of the ribosomal stalk base (H43-H44, 
rpL12), similar to that induced by eEF2 binding (Supplementary 
Fig. 5)?*?°. Both the stalk base and the CTD of Dom34 move closer 
towards the SRL of H95 and a strong contact between the CTD of 
Dom34 (helix 7) and the SRL is established (Fig. 3d). Very similar 
conformations of the stalk base, as well as of the central domain and 
CTD, were observed for aPelota on the archaeal ribosome, although 
some molecular details of domain fold and ribosome interaction 
pattern also differ between Dom34 and aPelota. Interestingly, helices 
a5, 06 and «7, which link the central domain and the CTD of aPelota, 
establish one long o-helix with a kink between «5 and «6 in the 
presence of aABCE] that reaches from the SRL deeply into the A site 
(Fig. 3e). 

In summary, in both species the presence of ABCE1 stabilizes an 
alternative conformation of Pelota on the ribosome, primarily affect- 
ing the central domain that reaches through the A site to contact the 
P-site tRNA. An analogous behaviour of the closely related release 
factors would ideally position the conserved GGQ loop for catalysing 
peptidyl-tRNA hydrolysis. 


Mechanochemical activity of ABCE1 on the ribosome 
Typically, ABC proteins generate mechanochemical work by nucleotide- 
driven clamp-like motions of the two NBDs: in the apo or ADP-bound 
state, NBDs adopt an open conformation often linked to a higher 
affinity for the given substrate of the ABC enzyme. ATP binding triggers 
a closed state, by binding to Walker A and B motifs of one NBD and the 
opposing conserved LSGGQ loop (signature motif) of the other NBD 
that coordinates the y-phosphate of ATP for subsequent hydrolysis”. 
ATP binding or subsequent ATP hydrolysis leads to a “power stroke’ 
that usually causes concomitant conformational changes in connected 
domains or binding partners. 

To analyse the mechanochemical function of ABCE1 in ribosome 
splitting, we compared the ribosome-bound conformation of ABCE1 
with the open ADP-bound form as observed in the crystal®* and witha 
model for the closed ATP-bound state (Fig. 4a). The model for the 
closed state is derived by individually superimposing the NBDs of 
ABCEI to NBDs crystallized in the ATP-bound state’®. Interestingly, 
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Figure 3 | Domain movements in Pelota and eRF1. a, Comparison of the 
ribosome-bound Dom34 conformation in complex with Hbs1 (top) and Rlil 
(bottom). b, Comparison of the aPelota-aEF1o crystal structure*” with the 
ribosome-bound aPelota in complex with aABCE1. The central domain (ce) of 
Pelota swings out towards the P-site tRNA. The inset shows a thumbnail 
indicating the view. c, Models for eRF1 before and after the suggested 


neither the open nor the closed model can be easily modelled into the 
electron density in the reconstructions. In both reconstructions, we 
observe an intermediate, half-open state of the two NBDs: NBD2 
rotates by approximately 17° towards NBD1 and the FeS cluster 
domain. However, an additional upward movement by 8 A of NBD2 
would be required to obtain the fully closed conformation, in which the 
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movement of the central domain. The arrows indicate the movement of the 
central domains. d, Conformation of the Dom34 CTD and the stalk base rRNA 
(H43-H44) when bound to Hbs!1 (yellow) and to Ril (blue). rRNA 
conformation without factors bound is shown in grey. e, In aPelota three 
separate small helices refold into a long «-helix during movement of the central 
domain bridging the CTD and the central domain. 


signature motif of one NBD domain contacts the nucleotide-binding 
pocket of the other NBD domain (Supplementary Movie 3). Notably, 
in the observed half-open state of ABCE1 we find a contact between the 
NBD2 domain and the FeS cluster domain that is not seen in the crystal 
structures of the open state. Adoption of the fully closed ATP-bound 
conformation would therefore require a substantial shift of the FeS 
cluster domain, also of about 8A (Fig. 4a), to avoid a steric clash. 
Although the limited resolution of the reconstruction does not allow 
for any conclusions regarding the nature of the bound nucleotide, the 
conformations of the individual lobes within the NBD1 and NBD2 
domains more closely resemble those of the ADP-bound crystal struc- 
tures. The similarity of the ‘intermediate’ conformation in both recon- 
structions suggests that binding to the ribosome induces an allosteric 
change in ABCE1, perhaps related to allosteric control of the ABC 
transporter by substrate binding”. 

The finding that ATP hydrolysis*’ is required for full splitting 
activity strongly suggests that ABCE1 indeed has to undergo a con- 
version from the observed half-open pre-splitting conformation to 
the fully closed ATP state to efficiently dissociate ribosomes. 
Therefore, we analysed the effect of ATP-dependent NBD domain 
closure by superimposing the half-open ribosome-bound state of 
ABCE1 with the model for the closed state. ABCE] in the closed 
conformation would not sterically clash with the ribosomal subunits 


Figure 4 | Mechanochemical activity of ABCE1 on the ribosome. a, Crystal 
structure of the open (ADP-bound) aABCE1, the cryo-EM structure of the 
ribosome-bound aABCE]1 and homology model of the closed (ATP-bound) 
aABCE1 including schematic drawings. An asterisk indicates a contact between 
NBD2 and the FeS domain of aABCE1. b, Ribosomal subunits may be 
dissociated by following the trajectory of aABCE1 domain closure upon ATP 
binding. c, Interactions of the aPelota NTD and central domain within the 
ribosome. d, ABCE1 domain closure could lead to an allosteric cascade with the 
FeS domain acting as a bolt on the CTD of Pelota to rearrange the NTD and 
central domain of Pelota. This mechanism would be analogous to the splitting 
reaction in bacteria by RRF and EF-G as depicted in e. 


to induce splitting. One possibility is that the small and large 
ribosomal subunits follow the trajectory of NBD1 and NBD2 of ABCE1, 
respectively. In this case the ribosomal subunits would sufficiently 
rotate away from each other so as to affect the intersubunit bridges 
and, thus, the overall ribosome stability (Fig. 4b). 

It is more likely, however, that the transition of ABCE1 through the 
closed conformation triggers an allosteric cascade affecting Pelota: the 
FeS cluster domain of ABCE1 contacts the NBD2 domain already in 
the half-open state and has to follow the movement of the NBD2 
during closure. This conformational change of the FeS cluster domain 
towards the intersubunit space is likely to be transmitted to Pelota via 
the close interaction with its CTD. A shift of the CTD would in turn be 
transmitted to both the NTD and the central domain of Pelota. These 
Pelota domains establish a network of contacts with the small and the 
large ribosomal subunit as well as with the P-site tRNA (Fig. 4c). 
Indeed, numerous mutations underline the functional importance of 
these domains for the activity of Pelota (Supplementary Tables 1 and 
2). A conformational shift can be easily envisaged to cause dissociation 
of the ribosome by destabilizing intersubunit bridges and the P-site 
tRNA. A function of the FeS cluster domain of ABCE] as a structural 
bolt to remodel Pelota by transmitting ATP-induced changes from the 
NBDs is in good agreement with the finding that deletion of this 
domain abolishes splitting activity*. The enhanced stability of the 
domain provided by the FeS cluster may be required in the transmis- 
sion of the mechanochemical power of ABCE1 for ribosome splitting. 

Although using an entirely different cast of characters, this scenario 
is structurally reminiscent of bacterial ribosome recycling by ribosome 
recycling factor (RRF) and elongation factor G (EF-G). In this case, an 
EF-G-based GTP-dependent conformational switch positions RRF to 
clash with the small ribosomal subunit, inevitably promoting subunit 
dissociation® (Fig. 4d, e). 


Conclusion 

We provide a structural basis and a universal mechanistic model for 
eukaryotic and archaeal ribosome recycling in which ABCE] actively 
coordinates rescue (or translation termination) with recycling and re- 
initiation (Fig. 5): 


Recognition 

ene vare) WD Hbs1/aEF1a 

(eRF1/aRF1) (eRF3/aEF1 «) 
GDP + Pi ey 
Stalled ; ills r. = 
(Pre-termination) ® 
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Reinitiation <—— Splitting 

Figure 5 | Scheme of archaeal and eukaryotic ribosome recycling bridging 
termination with initiation. A translational GTPase (Hbs1/aEFla/eRF3) 
delivers the factor, which recognizes stalled ribosomes (Pelota) or pre- 
termination complexes (eRF1/aRF1). After GTP hydrolysis, the GTPase 
dissociates and ABCE1 can bind. ABCE1 induces or stabilizes the swung-out 
conformation of Pelota (or RF1), which would lead to peptide release in case of 
termination. Ribosome splitting is induced after ATP binding to ABCE1 and 
hydrolysis. In eukaryotes, initiation factors can bind during the splitting 
reaction, coupling ribosome recycling with re-initiation. After splitting ABCE1 
stays associated with the small ribosomal subunit. 
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In the first stage, the recognition stage, the sensing factors Pelota (for 
rescue) or RF1 (for termination) are delivered to stalled ribosomes or 
pre-termination complexes by EF-Tu-like GTPases. In the next step, 
the GTPase dissociates to allow ABCE1 binding to the ribosome. 
ABCE1 interacts with the CTD of Pelota (or of RF1 in termination) 
to stabilize the extended conformation of the central domain. In the 
case of translation termination, the GGQ motif of RF1 will be posi- 
tioned proximal to the CCA-end of the P-site tRNA to catalyse peptide 
release; in the case of ribosome rescue, the central domain will be 
tightly accommodated proximal to the peptidyl transferase centre. 
Subsequently, in both cases, ABCE1 triggers ribosome disassembly 
into subunits by a power stroke upon NBD domain closure and ATP 
hydrolysis’. Our biochemical and structural data suggest a universal 
role of ATP hydrolysis in the mechanism of ABCE1-driven recycling. 
The conformational switch of ABCE1 could cause either a direct dis- 
ruption of the ribosomal intersubunit bridges or, more likely, further 
conformational changes via an allosteric cascade from the FeS cluster 
domain of ABCE] to the central domain and NTD of Pelota. In the 
archaeal system ABCE1 remains bound to the small ribosomal subunit 
after splitting* and it has been also found on the small subunit in 
eukaryotes'*"’. Notably, ribosome recycling is coupled in eukaryotes 
with re-initiation when initiation factors such as eIlF3, eIFl and eIF1A 
bind the small ribosomal subunit as recycling is completed**. An 
initial recruitment of eIF3 to the 80S ribosome may even occur directly 
via ABCE1 interaction with the eIF3 subunit eIF3j (Hcrlp in yeast), 
even before recycling is completed'**’”°. In contrast, the analogous 
bacterial recycling system consisting of RRF and EF-G acts only after 
termination is completed and the participation of initiation factors is 
less clear*’. 

In conclusion, the archaeal and eukaryotic kingdoms have maintained 
an extremely conserved general ribosome recycling system with an ABC- 
type ATPase at the core: the mechanochemical properties of ABCE1 are 
used through a still somewhat enigmatic FeS cluster domain. This 
domain triggers an allosteric cascade that actively coordinates translation 
termination or rescue with recycling’’, and eventually with re-initiation. 
It remains a puzzle as to why a complex FeS cluster domain is apparently 
used for a structural role only and has not been replaced by a simpler 
structure over billions of years of evolution. Thus, it is highly desirable to 
seek deeper insight into additional functions of ABCE] in processes such 
as translation initiation and ribosome assembly. 


METHODS SUMMARY 


Programmed yeast SL-RNCs were prepared from cell-free extracts as 
described'*’. Archaeal ribosomes were purified from cell-free extracts” by sucrose 
density centrifugation. Ribosome binding partners (Dom34, aPelota, ABCE1, 
aRF1 and alF6) were expressed in E. coli or S. cerevisiae (Rlil) and purified using 
affinity chromatography. Ligands were reconstituted in vitro with SL-RNCs or 70S 
ribosomes, and binding was analysed by SDS-PAGE after pelleting of ribosome- 
bound fractions. Splitting activity was monitored in sucrose gradients using 
ultraviolet profiles. For cryo-EM, yeast and archaeal recycling complexes were 
vitrified and data were collected on a Titan Krios electron microscope (FEI 
Company). Single-particle analysis and three-dimensional reconstruction was 
done using the SPIDER software package*’. Homology models were generated 
using HHPRED“ and MODELLER”*. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Purification of SL-stalled RNCs. Yeast (S. cerevisiae) ribosomes were stalled 
using a synthetic SL 3’ of the coding region (sequence: 5'-GATATCCCGTG 
GAGGGGCGCGTGGTGGCGGCTGCAGCCGCCACCACGCGCCCCTCCAC 
GGGATATC-3’) as described before'*!*!. The mRNA coded for the 120 
N-terminal residues of DBAP-B with additional N-terminal haemagglutinin 
(HA) and Hisg tags. RNC complexes were purified after in vitro translation in a 
yeast cell-free translation extract as described previously’. 

Purification of ribosomes from P. furiosus or T. kodakarensis. Ribosomes were 
purified from frozen P. furiosus or T. kodakarensis cell pellets. Cell pellets were 
suspended in 1.3X $30 buffer (10mM Tris pH 7.4, 60mM potassium acetate 
(KOAc), 14mM MgCl,) overnight at 4 °C. The homogenous lysate was disrupted 
with a microfluidizer (Microfluidics). Cell debris was removed by centrifugation at 
20,000g at 4°C. The supernatant was decanted and ribosomes were pelleted 
through a high-salt sucrose cushion (1 M sucrose, 500 mM NH,OAc, $30 buffer) 
at 312,000g (RP80AT, Sorvall) for 60 min. The ribosomal pellet was suspended in 
buffer TrB25 (56mM Tris pH 8.2, 250mM KOAc, 80mM NH,OAc, 25mM 
MgCl, 1 mM dithiothreitol (DTT)). The ribosomes were then gradient purified 
(10-40% sucrose, 10 mM Tris pH 7.4, 60 mM KOAc, 14mM MgCl,) at 45,600g 
(SW40, Beckman Coulter) for 16h at 4°C. The fractions were collected using a 
Gradient Station (Biocomp) with an Econo UV Monitor (Biorad) and a FC203B 
Fraction Collector (Gilson). The fractions containing 70S ribosomes were washed 
with $30 buffer and concentrated using a 100 kDa Amicon Ultra Centrifugal Filter 
Unit (Millipore). S30 and TrB25 buffer are based on published protocols***’. 
TrB25 is derived from the translation buffer and was modified for our purpose. 
Purification of aABCE1/Rlilp, aPelota/Dom34p, aRF1 and alF6. For, 
Rlilp, RLI1 was cloned from yeast (S. cerevisiae) genomic DNA into pYES2 
and induced in INVScl cells (Invitrogen) at 30 °C for 16h. Cells were harvested 
and suspended in Ni-NTA lysis buffer (75 mM HEPES pH 8.0, 300 mM NaCl, 
5 mM f-mercaptoethanol, 1% Tween, 20 mM imidazole, 10% glycerol), frozen in 
pellets and lysed in a liquid nitrogen Freezer/Mill (SPEX SamplePrep, LLC). 
Lysate was clarified and purified over a HisTrap FF column (GE Healthcare) 
onan ATKA FPLC (GE Healthcare). Additional purification was conducted over 
an $100 size exclusion column (GE Healthcare) pre-equilibrated in Buffer SE 
(20 mM Tris-Cl pH 7.5, 200 mM NaCl, 5mM f-mercaptoethanol, 5% glycerol). 
Purified protein was observed to have a brown/yellow colour. 

For Dom34p, N-terminally tagged S. cerevisiae protein Dom34p was over- 
expressed in E. coli in a pET21a(+) vector and purified via a Ni-NTA affinity 
chromatography as described before**””. 

For aABCEI, C-terminally strep-tagged aABCE1 from P. furiosus was 
expressed and reconstituted as described previously®. Aliquots were kept under 
anaerobic conditions and stored at —80 °C. 

For aPelota, aPelota from T. kodakarensis genomic DNA was cloned into 
pET28 (Novagen) generating a C-terminally His-tagged construct. The E. coli 
strain Rosetta(DE3) (Novagen) was used for expression at 37 °C for 2-3 h. Cells 
were harvested and suspended in buffer A (10 mM Tris pH 8.0, 500 mM NaCl, 
1mM DTT) with 1x Complete EDTA-free Protease Inhibitor cocktail (Roche), 
2mM PMSE and 6 gm! * DNasel. Lysis was achieved using a microfluidizer 
(Microfluidics) at 120.66 MPa. After centrifugation the supernatant was purified 
over a HisTrap HP column (GE Healthcare) on an AKTA FPLC Purifier (GE 
Healthcare). 

For aRF1, aRF1 from T. kodakarensis was prepared as described above for 
aPelota. After elution from the HisTrap HP column, the protein was dialysed 
over night against a 1,000X excess of buffer A. After a heat denaturation step at 
55°C for 10 min, 1% glycerol was added before concentration. Precipitate was 
removed by centrifugation. 

For alF6, alF6 from T. kodakarensis was prepared as described above for 
aPelota. 

Reconstitution of yeast RNC-Dom34-Rlil complexes. For in vitro binding 
assays and cryo-EM, 2 pmol of yeast SL-RNCs were reconstituted with a 10-fold 
molar excess of Dom34p and Rlilp in a volume of 25 ul under final conditions of 
20 mM Tris/HCl pH 7.0, 150 mM KOAc, 10 mM Mg(OAc)s, 1.5 mM DTT, 0.005% 
Nikkol, 10 pg mg | cycloheximide, 0.3% (w/v) digitonin, 500 4M ADPNP and 
incubated for 15 min at 25°C and 10min on ice. To assess ligand binding to 
ribosomes, reactions were applied to a 750mM sucrose cushion and spun for 
2.5h at 152,000g. at 4°C in a SW55 rotor (Beckman Coulter). Supernatant 
and pellet fractions were analysed by SDS-PAGE followed by SYPRO Orange 
(Bio-Rad) staining. Stained proteins were visualized on a phosphorimaging screen 
(Typhoon 9400, GE Healthcare). 

Reconstitution of archaeal 70S-aPelota-aABCE1 complexes. Archaeal com- 
plexes for cryo-EM were reconstituted under anaerobic conditions (glove box, 
Coy Laboratories) in degassed buffer TrB50 (56 mM Tris pH 8.2, 250 mM KOAc, 
80 mM NH,OAc, 50mM MgCl, 1mM DTT) with 2mM ADPNP. 8.5 pmol of 
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ribosomes (P. furiosus), 40pmol of aPelota and 40 pmol of aABCE1 were 
incubated for 25min at 30°C. The complexes were then diluted to 4.0 
OD 60 nm for cryo-EM and kept at room temperature (23 °C) until vitrification. 

Ligand binding assays were performed as described earlier for the archaeal 
cryo-EM complexes. After incubation the reactions were applied to a 1 M sucrose 
cushion and spun for 45 min at 189,000g in a TLA100 rotor (Beckman Coulter). 
Supernatant and pellet fractions were analysed as described above. 

Splitting of archaeal 70S ribosomes. 15 pmol of ribosomes (T. kodakarensis) 
were incubated with a 2.5-fold molar excess of ligands under anaerobic conditions 
in buffer TrB25. ATP was added toa final concentration of 2 mM. The reaction was 
incubated for 25 min at 25 °C. Splitting of the ribosomes was evaluated by sepa- 
ration on a 15-40% sucrose gradient (56 mM Tris pH 8.2, 250 mM KOAc, 80 mM 
NH,OAc, 50 mM MgCl, 1 mM DTT), at 164,000g (SW-60, Beckman Coulter) for 
3h at 4°C. The gradients were analysed using a Gradient Station (Biocomp) with 
an Econo UV Monitor (Biorad) and a FC203B Fraction Collector (Gilson). 
Electron microscopy and image processing. Freshly prepared sample was 
applied to 2nm pre-coated Quantifoil R3/3 holey carbon supported grids and 
vitrified using a Vitrobot Mark IV (FEI Company) and visualized on a Titan 
Krios TEM (FEI Company) under low-dose conditions (about 20e per A?) at 
a nominal magnification of X75,000 with a nominal defocus between —1 jum and 
—3.5 um. 

The yeast SL-RNC-Dom34-Rlil data set was collected at 300 keV at a mag- 
nification of X 128,200 at the plane of CCD using an Eagle 4k * 4k CCD camera 
(FEI Company, 4,096 X 4,096 pixel, 15 jum pixel, 5 s/full frame) resulting in an 
image pixel size of 1.17A (object scale). The archaeal aPelota-aABCE! data set 
was collected at 200 keV at a magnification of X 148,721 at the plane of CCD using 
a faster TemCam-F416 CMOS camera (TVIPS GmbH, 4,096 x 4,096 pixel, 
15.6 jum pixel, 1 s/full frame), resulting in an image pixel size of 1.049 A (object 
scale). 

Data collection was facilitated by the semi-automated software EM-TOOLS 
(TVIPS GmbH), allowing manual selection of appropriate grid meshes and holes 
in the holey carbon film. The acquisition automatically performed a re-centering, 
drift and focus correction before the final spot scan series were taken. Long-term 
TEM instabilities in beam shift, astigmatism and coma were corrected by EM- 
TOOLS regularly (for example, every 45 min). Selected on the basis of power 
spectra quality, typically 70% of the recorded images were used for the subsequent 
reconstruction. 

Data processing was done using the SPIDER software package*’. For data pro- 
cessing from the TITAN KRIOS microscope we developed a new automated work- 
flow including import of the original .tif files, automated conversion into SPIDER 
and MRC format, CTF determination using the SPIDER TF ED command and 
automated particle selection based on the program Signature”. After initial par- 
ticle selection a second selection of the data set was done using a newly developed 
machine-learning algorithm (MAPPOS; http://arxiv.org/abs/1112.3173v2) that 
detects wrongly selected particles (‘non-particles’) such as contaminations, noise, 
carbon edges etc. An ensemble classifier was trained to categorise the data set based 
ona smaller training set containing good and non-particles, respectively. This was 
achieved by discriminatory features that were extracted from each image. 
Identified non-particles were then omitted from the data set. 

The 80S-SL-RNC-Dom34-Rlil data set was refined to a final resolution of 
7.2 A (Fourier shell correlation (FSC) cut-off 0.5). Refinement and sorting of 
144,500 particles was performed as described before'®'. The data set was first 
split into two subsets representing particles with and without Dom34 and Rlil 
(101,700). This subset was further sorted according to presence of P-site tRNA 
(45,700 particles with tRNA, 56,000 particles without tRNA). 

The entire 70S-aPelota-aABCE] data set contained 365,000 particles. The data 
set was sorted according to the presence of aABCE] and P-site tRNA‘*'. The data 
subset with aABCE1 still displayed heterogeneity regarding occupation with 
tRNA or aPelota in the A site, and it was further sorted accordingly. The final 
data set contained 51,000 particles and the final resolution was 6.6 A (ESC 0.5). 
Model building for aPelota/Dom34 and aABCE1/RIil. For the generation of 
protein homology models, the programs HHPRED“ and MODELLER* were 
used. For generating the Dom34 model in the Rlil-bound state, the existing 
crystal structure in complex with Hbs1 (PDB accession 3MCA)” and the model 
for ribosome-bound Dom34 (PDB accession 31ZQ)'® was used. The model for the 
T. kodakarensis aPelota was built using the X-ray structure (PDB accession 
3AGJ)**. Models for S. cerevisiae Rlil and P. furiosus aABCE1 were generated 
based on existing crystal structures (PDB accession 3BK7)°. Because in all elec- 
tron densities the secondary structure of proteins was visible, a highly reliable 
initial rigid body fit for aPelota/Dom34 aABCE1/Rlil could be performed using 
Coot and UCSF Chimera®*”*. On the basis of this initial fit, we used the programs 
DireX*’ and the molecular dynamics flexible fitting (MDFF) method***’ to 
interactively refine the models. A model for the S. cerevisiae ribosome**** was 
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used for molecular interpretation of Dom34/Hbs1-ribosome interactions. For 
analysing the archaeal complex a model for the P. furiosus 70S ribosome was built 


as described before 
46. 
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Abrupt acceleration of a ‘cold’ ultrarelativistic wind 


from the Crab pulsar 


F. A. Aharonian?, S. V. Bogovalov? & D. Khangulyan* 


Pulsars are thought to eject electron-positron winds that energize 
the surrounding environment, with the formation of a pulsar wind 
nebula’. The pulsar wind originates close to the light cylinder, the 
surface at which the pulsar co-rotation velocity equals the speed of 
light, and carries away much of the rotational energy lost by the 
pulsar. Initially the wind is dominated by electromagnetic energy 
(Poynting flux) but later this is converted to the kinetic energy of 
bulk motion’. It is unclear exactly where this takes place and to 
what speed the wind is accelerated. Although some preferred 
models imply a gradual acceleration over the entire distance from 
the magnetosphere to the point at which the wind terminates**, a 
rapid acceleration close to the light cylinder cannot be excluded*”. 
Here we report that the recent observations of pulsed, very high- 
energy y-ray emission from the Crab pulsar’° are explained by the 
presence of a cold (in the sense of the low energy of the electrons in 
the frame of the moving plasma) ultrarelativistic wind dominated 
by kinetic energy. The conversion of the Poynting flux to kinetic 
energy should take place abruptly in the narrow cylindrical zone of 
radius between 20 and 50 light-cylinder radii centred on the axis of 
rotation of the pulsar, and should accelerate the wind to a Lorentz 
factor of (0.5-1.0) X 10°. Although the ultrarelativistic nature of 
the wind does support the general model of pulsars, the require- 
ment of the very high acceleration of the wind in a narrow zone not 
far from the light cylinder challenges current models. 

The Crab pulsar is one of the brightest y-ray sources in the sky. Both 
the light curve and the energy spectrum have been studied”° in great 
detail by the Large Area Telescope on board NASA’s Fermi Gamma- 
ray Space Telescope (Fermi). The phase-averaged spectrum is best fitted 
by a power law with a photon index of « = 1.97 and an exponential cut- 
off at E, = 5.8 GeV (Fig. 1). Although modified ‘outer gap’ models"' do 
allow an extension of the spectrum up to 10GeV, the detection of 
pulsed, very high-energy (VHE) y-ray emission demands a different 
radiation component. The extrapolation of the fluxes reported by Fermi 
to the VHE domain as a power law with photon index « ~ 3.8, and the 
claim that such a formal fit is evidence that y-rays of gigaelectronvolt 
(GeV) energies have the same magnetospheric origin as those of 
teraelectronvolt (TeV) energies*””’, in fact requires a drastic revision 
of basic concepts used at present in magnetospheric models. Moreover, 
the assumption of a magnetospheric origin for radiation over the entire 
y-ray domain contradicts the essentially different light curves reported 
at GeV (ref. 10) and TeV (refs 7, 9) energies (unless the production sites 
of these two components are well separated), as well as the apparent 
tendency of spectral flattening above 100 GeV (Fig. 1). 

A natural and more plausible site of production of pulsed VHE 
y-rays is the ultrarelativistic wind illuminated by photons originating 
in the pulsar’s magnetosphere and/or the surface of the neutron star’’. 
In the case of the Crab pulsar, the phase-averaged flux of the pulsed 
(magnetospheric) component exceeds the flux of the thermal emission 
of the neutron star by two orders of magnitude. The combination of 
the hard spectral energy distribution of the pulsed emission and the 
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Figure 1 | Spectral energy distribution of y-ray radiation produced by the 
pulsar magnetosphere and by the pulsar wind. Symbols show the reported 
y-ray fluxes with 1-s.d. error bars’. Curves show theoretical predictions (this 
work). The Fermi Large Area Telescope points’® are best fitted by the function 
Fz =3.8X 10 3E°exp[—E/5.8 GeV]Jm~*s ! (dashed grey line). 
Assuming a slightly harder spectrum in the cut-off region, with 

Fp =3.8X 10 Sexpl—(E/7 GeV)°*] Jm~*s_! (solid grey line), the 
MAGIC ‘mono’ data points*® can be explained as well (because of large 
systematic uncertainties, the mono 100-GeV point, which differs by a factor of 
three from the flux measured by two MAGIC telescopes in the more reliable 
stereoscopic regime’, perhaps ought to be discarded). This spectrum is 
somewhat harder than that predicted by standard magnetospheric models, but 
does not challenge them'*'*. The inverse-Compton y-ray emission of the cold 
ultrarelativistic wind’’ can naturally explain the pulsed y-ray fluxes reported”? 
above 100 GeV. The solid light-blue, blue and green curves are calculated under 
the assumption of ‘instant’ acceleration of the wind at the fixed radius Ry. In 
principle, the acceleration can start earlier, but closer to the light cylinder the 
acceleration rate should be modest; otherwise it would lead to overproduction 
of inverse-Compton y-rays. Earlier acceleration is demonstrated by the dashed 
black curve, which is calculated under the assumption that acceleration starts at 
the light cylinder with a rate that increases in proportion with R® up to 

Ry = 30R;, where the Lorenz factor equals 5.5 X 10° (Supplementary 
Information). The solid red curve corresponds to the case in which the 
Poynting flux transformation takes place within the 20R,-50R, zone, assuming 
the wind’s acceleration rate to be independent of distance; the maximum 
Lorentz factor, achieved at 50R, is set to 10°. (The dotted grey line corresponds 
to the superposition of the red and solid grey lines and shows the transition 
between the two radiation components.) Because of the decrease in the density 
of target photons with distance, the main fraction of VHE radiation is produced 
at around 30R, with a Lorentz factor close to 5 X 10°. This explains the general 
similarity of the red curve to the instant-acceleration curves, apart from in the 
highest-energy region, where the sharp cut-off of the red curve is shifted to 
~500 GeV. 
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reduction of the Compton cross-section due to the Klein-Nishina 
effect means that the X-ray band is the main contributor to the 
Comptonization of the wind. The X-ray flux is well measured up to 
100 keV (ref. 14) and therefore the calculations of the inverse- 
Compton radiation depend basically on the site and the dynamics 
(speed) of transformation of the Poynting flux to kinetic energy of 
bulk motion. 

We assume that at a distance R,, from the pulsar, the wind is 
accelerated to the Lorentz factor I, (Fig. 2). Particles of the accelerated 
wind cannot move purely radially, because the wind should carry both 
the energy and the angular momentum lost by the pulsar. From the 
relation between the rotation energy (Eo) and angular momentum 
(M,o:) losses, Erot =@2Mrot, where Q is the angular velocity of the 
rotating sphere and a dot denotes a time derivative, we can define 
the trajectory of the wind particles. Indeed, each particle of the wind 
carries energy I” ‘mc’ and angular momentum J\mr , v, where m, r, 
and vare the particle’s mass, lever arm and speed, respectively, and c is 
the speed of light. Because I\ymr,vQ =I" ‘ymc’, particles in the 
accelerated wind move along straight lines, tangent to the light 
cylinder. Therefore, all photons emitted by the magnetosphere will 
collide with electrons of the wind at a non-zero angle, 0, resulting in 
inverse-Compton y-rays. The y-ray production efficiency depends on 
the electron Lorentz factor, the density of the target photons and the 
interaction angle. Because the cold wind carries almost the entire spin- 
down luminosity, even a tiny efficiency of about x ~ 10° © should be 
sufficient to produce detectable y-rays at an energy flux level of 
Fe= KE rot 40d? ~ 10-15 Jm~2s~—!, where d~ 6 X 10!’ m is the dis- 
tance to the Crab. 

Generally, the light curve of the target photons should be reflected in 
the time structure of the inverse-Compton y-ray signal; however, they 
cannot be identical, owing, for example, to the effects related to the 
specifics of the anisotropic inverse-Compton scattering. More impor- 
tantly, the geometrical effects may lead to non-negligible differences 
between the arrival times of the target photon and the secondary y-ray 
pulses (Fig. 3). For wind located close to the light cylinder, the y-ray 
signal seems shifted in time relative to the reported y-ray data, by 
At ~ 0.1T. By contrast, for wind acceleration at Ry = 30R,, the widths 
and the positions of the predicted and observed y-ray peaks (P1 and 
P2, respectively) are in very good agreement. However, whereas in 
the case of the isotropic wind the predicted P1/P2 flux ratio of the 
y-ray signal mimics the X-ray light curve’* (Fig. 3, black crosses), 
the reported y-ray data”? seem to correspond to a smaller ratio, 
P1/P2 < 1. This can be explained by there being a non-negligible wind 


anisotropy, which would introduce noticeable corrections to the shape 
of the y-ray light curve in general and to the P1/P2 ratio in particular 
(Fig. 3). The large uncertainties in the present y-ray data prevent us 
from a reaching a strong conclusion in this regard, but the improve- 
ment of the quality of VHE y-ray light curves should in future allow the 
strength and the character of the wind anisotropy to be decisively 
probed. 

GeV y-rays have a light curve’® that is essentially different from the 
reported VHE light curves”. This can be interpreted as a result of the 
production of GeV and TeV y-rays in regions well separated from each 
other. This conclusion is supported by the spectral energy distribution 
of the time-averaged GeV and TeV signals. As demonstrated in Fig. 1, 
the entire y-ray region can be considered a superposition of two sepa- 
rate components. Indeed, by introducing a new, flat-spectrum VHE 
component of the Comptonized wind, in addition to the nominal 
(magnetospheric) GeV component, the reported data in the GeV-to- 
TeV energy intervals can be smoothly matched. 

Although inverse-Compton y-rays are produced by mono-energetic 
electrons, the spectral energy distribution of y-rays in the range of tens 
to hundreds of GeV is quite flat. This is caused by the combination of 
effects related to the broad power-law distribution of seed photons and 
the transition of the Compton cross-section from the Thomson regime 
to the Klein-Nishina regime. On the other hand, the spectrum is 
expected to have a very sharp cut-off at E= I°,mc’. This not only 
can serve as a distinct feature for the identification of the wind origin 
of y-rays, but also should allow us to determine the Lorentz factor of 
the wind. In fact, the measurements available at present do not allow 
strong deviation of the Lorentz factor from 5 X 10°. We note that the 
calculations do not depend on the ‘magnetization parameter’ o (the 
ratio of the electromagnetic energy flux to the kinetic energy flux) as 
long as Ry>>R,. However, formally we can explain the pulsed VHE 
emission even for ¢ = 1. In this case, the acceleration should occur 
closer to the pulsar (Ry « 1/a"”) to compensate for the reduction in 
the wind’s kinetic energy. But in this case, the inverse-Compton y-ray 
radiation is expected to have quite different spectral and temporal 
features. 

The above estimates of the location of wind’s acceleration site and its 
Lorentz factor are quite robust, but they are obtained under the 
assumption that the transformation of the Poynting flux proceeds 
very quickly, at a specific radius between Ry, and Ry + dR, with 
OR,/Ry = 1. This is not an obvious assumption, but is instead a 
working hypothesis that the wind acceleration takes place in a narrow 
zone at the radius Ry ~ 30R,. We cannota priori exclude the possibility 
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Figure 2 | Complex comprising the pulsar 
magnetosphere, the ultrarelativistic wind and the 
pulsar wind nebula. Dense electron (e )-positron 
(e*) plasma produced in the pulsar magnetosphere 
by pair creation processes” initiates an electron- 
positron wind at the light cylinder, which has 
radius Ry ~ 10° m. Initially, the rotational energy 
lost by the pulsar, Ero: =5 x 10°! J s~!, is released 
mainly in the form of electromagnetic energy 
(Poynting flux) and the wind’s Lorentz factor 
therefore cannot be very large. Ata distance R,, the 
Poynting flux is converted to the kinetic energy of 
bulk motion (green zone), leading to an increase in 
the bulk-motion Lorentz factor to at least’® 

Ty ~ 10°. The termination of the wind by a 
standing reverse shock at Ry, ~ 3 X 101° m boosts 
the energy of the electrons to 10’* eV and 
randomizes their pitch angles’. The radiative 
cooling of these electrons through the synchrotron 
and inverse-Compton processes results in an 
extended non-thermal source*’*’, the Crab nebula. 
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Figure 3 | Formation of the pulsed VHE inverse-Compton y-ray signal in 
the wind of the Crab pulsar. a, Geometry of the inverse Compton scattering of 
magnetospheric X-rays by the electron—positron wind. b, Theoretical y-ray 
light curves of the wind presented together with the reported VHE data”. The 
velocity of the accelerated wind is tangential to the light cylinder (the direction 
of motion of electrons towards the observer is shown by the dashed green 
arrow). The interaction of electrons with the magnetospheric X-rays occurs 
predominantly at a distance R ~ R,,, where the wind is accelerated. Owing to 
the decrease in the target photon density with distance, the production of 
inverse-Compton y-rays is suppressed at larger distances. The target X-ray 
photon converted to a VHE y-ray photon reaches the observer earlier than an 
‘identical’ photon emitted directly towards the observer. Two factors contribute 
to the time shift, At: the up-scattered X-ray photon is emitted by the pulsar 
earlier, by a time 0T/2n, where T is the pulsar period; and it travels an additional 
path length of R,[1 — cos(@)]. For Ry>>R_, the time shift is negligibly small: 
At ~ —(T/4m)R,/R,,. For acceleration of the isotropic pulsar wind at 

R, = 30R,, the y-ray light curve (solid blue line) closely resembles the shape of 
the measured X-ray light curve’ (black crosses). For wind accelerated close to 
the light cylinder, the y-ray light curve is shifted and somewhat broadened by 
comparison with wind accelerated at Ry>>R,. The anisotropy of the wind can 
also strongly deform the y-ray light curve; in particular, it can change the ratio 
of the fluxes corresponding to peaks P1 and P2. The solid red line is calculated 
for an anisotropy factor proportional to the square of the sine of the angle 
between the line of sight and the direction of the magnetic momentum. This 
light curve seems to be in better agreement with the VERITAS’ and MAGIC’ 
points than the light curve corresponding to the fully isotropic wind, although 
the statistical and systematic uncertainties of observations (only Poisson error 
bars corresponding to the total count rates are shown on the plot) do not allowa 
definite conclusion in this regard. 


that the wind is gradually accelerated starting from the edge of the 
magnetosphere, but our numerical calculations show that this cannot 
be the case (Fig. 1 and Supplementary Information). This is because the 
gradual acceleration would lead to a large number of high-energy 
electrons being accelerated close to the light cylinder and, con- 
sequently, to the prolific production of inverse-Compton y-rays, in 
contradiction with the reported fluxes. Thus, the effective acceleration 
of the wind should start not much before the radius of 30R;, and not 
much beyond it. Such a case, assuming a linear acceleration rate of 
I(R) = Io + a(R/R, — 1) within the 20R,-50R, radial interval and a 
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maximum Lorenz factor of 10° achieved at 50R,, is shown in Fig. 1. The 
corresponding y-ray spectrum is smoother than the energy spectra 
predicted in the case of an instant acceleration, and better fits the 
VHE spectral points (Fig. 1) with the position of the sharp cut-off in 
the y-ray spectrum shifted to 500 GeV. Although the wind acceleration 
within the 20R;-50R, interval seems to be a physically more realistic 
scenario than an instant acceleration, this is still quite a narrow zone 
and the acceleration of the wind up to the Lorentz factor of 10° is 
therefore quite abrupt. This conclusion does not agree with those of 
alternative models, for example the so-called reconnection models of 
pulsar wind nebulae** based on the assumption that the transforma- 
tion of the Poynting flux to kinetic energy of bulk motion is a slow 
process that takes place over the entire region of the unshocked wind. 
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Wetting of flexible fibre arrays 


C. Duprat!, S. Protiére?*, A. Y. Beebe! & H. A. Stone! 


Fibrous media are functional and versatile materials, as demon- 
strated by their ubiquity both in natural systems such as feathers’ * 
and adhesive pads’ and in engineered systems from nanotextured 
surfaces® to textile products’, where they offer benefits in filtration, 
insulation, wetting and colouring. The elasticity and high aspect 
ratios of the fibres allow deformation under capillary forces, which 
cause mechanical damage*, matting*” self-assembly" or colour 
changes”, with many industrial and ecological consequences. 
Attempts to understand these systems have mostly focused on 
the wetting of rigid fibres’*”’ or on elastocapillary effects in planar 
geometries’® and on a fibre brush withdrawn from an infinite 
bath”. Here we consider the frequently encountered case of a liquid 
drop deposited on a flexible fibre array and show that flexibility, 
fibre geometry and drop volume are the crucial parameters that are 
necessary to understand the various observations referred to above. 
We identify the conditions required for a drop to remain compact 
with minimal spreading or to cause a pair of elastic fibres to 
coalesce. We find that there is a critical volume of liquid, and, hence, 
a critical drop size, above which this coalescence does not occur. We 
also identify a drop size that maximizes liquid capture. For both 
wetting and deformation of the substrates, we present rules that 
are deduced from the geometric and material properties of the 
fibres and the volume of the drop. These ideas are applicable to a 
wide range of fibrous materials, as we illustrate with examples for 
feathers, beetle tarsi, sprays and microfabricated systems. 

Owing to the numerous environmental and industrial applications 
of fibrous media, their wetting has been studied extensively. Most 
research focuses on drops or flow on individual rigid fibres, often in 
an array. However, in most applications, the elasticity of the fibres is 
important, as evidenced from the matting of feather barbules’, the 
shrinkage of porous fibre membranes’, strengthening of paper after 
drying”®, the clumping of the setae of beetle tarsi after release of tarsal 
oil’, or the collapse of micro- or nanopillar arrays'°’*. Therefore, we 
are motivated by the interaction of a mist of drops with a deformable, 
or flexible, array of fibres. The basic elastocapillary response is 
observed in the behaviour of a liquid drop on a pair of fibres, which 
is where we begin. 

For a perfectly wetting drop (that is, a drop that connects with the 
fibre with a zero contact angle) deposited on two parallel, rigid fibres, 
the minimization of surface energy yields three distinct drop shapes 
depending on the ratio of the distance between the fibres, 2d) 
(measured from their outer surfaces), and their diameter, 2r (Fig. 1b). 
As d/r is decreased, the drop evolves from a bridge to a barrel shape and 
then spreads out into a liquid column'*"”: a drop forms for do/r > \2, a 
column forms for do/r < 0.57 and there is non-uniqueness of the shape 
for 0.57 < do/r < J2. 

In this Letter, we investigate the behaviour of a perfectly wetting 
drop deposited onto two horizontal, flexible fibres that at one end are 
clamped, parallel to each other, a distance 2d) > 2\2r apart and at the 
other end are free to move (Methods and Fig. 1a). Our results can be 
extended to the case of partial wetting by adding as an additional 
parameter the effective contact angle, 0 < m/2 (Supplementary Fig. 4). 
We neglect gravitational effects because the fibres do not bend under 


their own weight and the drop sizes are smaller than the capillary 
length, /., above which gravitational effects become important. By 2d 
we denote the distance between the fibres at the drop location. When 
the drop is placed on the fibres close to the clamped ends, the fibres 
deflect inwards and the drop moves spontaneously towards the free 
ends, which are closer together (Fig. 1c and Supplementary Movie 2), 
as observed for a drop in a wedge”’. As the drop advances, the deflec- 
tion increases, that is, d/r continuously decreases. The drop accelerates, 
elongates and then spreads spontaneously between the fibres, drawing 
them together finally to form a liquid column between coalesced fibres. 

We performed a large set of experiments to characterize the final 
state as a function of the drop volume, V; the fibre length, L; and the 
ratio do/r (Methods). For every value of do/r, we find three different 
final states as L and V are varied (Fig. 2a-c). Whereas the final state 
(drop or column) of a finite volume of liquid deposited on two rigid 
fibres depends on only one parameter, that is, do/r, and is hence inde- 
pendent of the drop volume and fibre length, we find that the final 
equilibrium state for a drop on two flexible fibres depends on six 
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Figure 1 | Shape transitions ofa drop sitting on two parallel fibres. a. A drop 
of perfectly wetting liquid (silicone oil) of volume 2 ul deposited on two parallel, 
rigid glass fibres (radius, r = 0.145 mm) adopts three different shapes 
depending on the distance between the fibres: a ‘bridge’ (do/r = 2.5), a ‘barrel’ 
(do/r = 1.5) anda column (d)/r = 1). d,, critical distance at which the drop-to- 
column transition occurs. b, Experimental set-up used to investigate the 
behaviour of a drop deposited on two flexible fibres, which are clamped at one 
end and free to move at the other. Left: top and side views, recorded 
simultaneously using a mirror. Right: expected cross-sections for a concave 
liquid column and a convex drop. The direction of gravity is indicated by g. 
c, Typical experiment with do/r = 2.7, V= 1.5 pl and L = 4cm. The time 
between successive images is 25 s. When the drop is deposited on flexible fibres, 
the fibres deflect inward. The drop spontaneously moves towards the free ends 
of the fibres. At a given location, z,, the drop starts spreading and the fibres are 
drawn together. The final wet length is denoted L,. 
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Figure 2 | The three different final states of a drop between two flexible 
fibres. a—c, Top and side views of the final state obtained for do/r = 2.6; a fixed 
volume, V = 2 ul; and increasing length, L = 3, 3.5 and 4cm. The final state 
changes from one of no spreading to one of partial spreading to one of total 
spreading as L increases. d, Phase diagram of the different regimes for 


parameters: r; dy; L; the bending stiffness, B, of the fibre; the surface 
tension, y; and V. For short fibres (L < 2 cm in the system presented in 
Fig. 2) and almost all drop volumes, the fibres deflect slightly inwards 
and the drop moves towards the free ends, but there is no spreading 
(Fig. 2a and Supplementary Movie 1). For longer fibres, there is a range 
of drop volumes such that when we increase L, the deflection increases 
and the whole drop spreads into a column (‘total spreading’; Fig. 2c 
and Supplementary Movie 2). Alternatively, for sufficiently large 
volumes, we observe a state of ‘partial spreading’, where there is a 
liquid column with a smaller drop remaining at the edge (Fig. 2b 
and Supplementary Movie 3). We summarize in Fig. 2d our results 
in a phase diagram of V versus L, which suggests that a critical size of 
drops in a spray can trigger coalescence of a fibrous material. 

To understand the transitions between the different regimes in 
Fig. 2d, we first consider the case of long fibres, for which either partial 
or total spreading occurs for all volumes investigated. We fix the fibre 
length and measure the length, L,, along which the liquid spreads for 
various drop volumes (Fig. 3a). For small V, the whole drop spreads 
into a liquid column. As V increases, the column length increases until, 
above a critical volume, V., a drop remains at the wider end of the 
column and the length L, actually decreases (Fig. 3a). The existence ofa 
maximum spreading length here is a consequence of elasticity. 

We can understand the maximum spreading length, L, ax» reached 
at V. as a balance between elasticity and capillarity. There is a minimal 
distance, Lg,y, along which the dry portions of the fibres can be bent by 
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Figure 3 | Influence of the initial drop volume on the final state. 

a, Transition between total and partial spreading: evolution of the spreading 
length, Ls, with the volume, V, of the drop for do/r = 2.6 and L = 3.5cm. We 
observe an optimum (maximum L,) at a critical volume of V. = 1.5 ul. 

b, Transition between spreading and no spreading: evolution of the position of 


LETTER 


V>V, 


fe) O° 
Partial _ 
spreading 
° ° 


Lary 


L (cm) 
do/r = 2.1. Depending on L and V, three different regimes are observed: the 
drop either moves towards the ends without spreading (diamonds; a), spreads 
until partial depletion of the drop (circles; b) or spreads completely into a liquid 
column (squares; c). The solid and dashed curves correspond to equations (2) 
and (3), respectively. The vertical dashed line corresponds to L = Lary. 


capillary forces. This is determined by minimizing the total energy of 


the system”, yielding 
9Bae \'/4 
Lary = {>> 1 
" (rats) se 


where S(«) is a geometric factor evaluated approximately for a flat liquid 
column as S(% = 1/2) = m — 2. This length is the minimum length of 
the fibres beyond which collapse, or significant deformation, can occur. 
For a given d)/r ratio, Lary is constant and the maximum wet length, 
Lsmax) increases linearly with L, that is, Lsmnax = L — Lary, in agreement 
with our experiments (Supplementary Fig. 1). This elastocapillary 
balance results in an optimal, or critical, drop volume for which the 
spreading length is maximal: V. = A(@)Lymax» where A(«) is the 
column cross-section (A(1/2) ~ mr for a flat column). The boundary 
between total and partial spreading is then predicted to be 


V=V.=mr (L—Lay) (2) 


which is also in agreement with the experimental transition (Fig. 2d). 
When liquid is added to this maximum column (Supplementary Fig. 2), 
the configuration is unstable: the liquid forms a drop at the wider end of 
the column. Minimization of the surface energy causes the liquid to 
retract to form this spherical drop, decreasing the length of the column. 

To understand the critical drop size beyond which no spreading 
(Fig. 2a) and, hence, no fibre coalescence occurs, we measured the 
spacing, d,, at which the drop starts spreading. We find that the ratio 
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the drop at spreading, z,, with V for do/r = 1.9 (orange), 2.4 (purple), 2.7 (red) 
and 4.2 (blue). The position of the drop at spreading is independent of the fibre 
length and increases with increasing volume, spacing and fibre rigidity (that is, 
the bending modulus, which is proportional to 7‘). The solid lines correspond 
to the theoretical prediction (Supplementary Information, equation (2)). 
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d,/r = 1.11 £0.07 is constant, independent of V, L and d)/r. We 
conclude that spreading occurs when locally the spacing between the 
fibres is such that a liquid column is energetically favourable relative 
to a drop. This criterion for spreading is similar to that obtained 
theoretically for rigid fibres'’ and similar to that found in our experi- 
ments on rigid glass fibres: d,/r = 1.04 + 0.07. 

Next we estimated the critical drop volume responsible for this 
spreading, where liquid is captured in a column. The drop is in a barrel 
shape (Fig. 1a) and thus is pierced by the two fibres. The capillary force 
applied by the drop, F ~ 2yI*, where [* is the typical length of the 
contact line, brings the fibres together. Coalescence of fibres will 
therefore occur if F is great enough to achieve a deflection of do — d, 
along the fibres, that is, if the capillary torque, Fz,, where z, is the 
position of the drop, equals the elastic resisting torque, B(dp — d,)/z¢. 
For a given value of V and, hence, F, this estimate leads to a critical drop 
position, z,? x B(dp — d,)/F, below which no spreading will occur, 
which is also verified experimentally (Fig. 3b). For a fixed value of do, 
Z, increases with increasing volume and, hence, the capillary force 
decreases. Therefore, for a given fibre length, there is a minimal force, 
that is, a maximal drop volume, for spreading to occur, set by z, = L. 
The force F ~ 2yI* depends on the complex shape of the contact line, 
which has a typical length [* x 2nrd,v“* (Supplementary 
Information). Combining these results yields the critical volume for 
spreading 


o ydrL3_ \? 
(Pata) i 


where the constant / depends only on the complex shape of the drop 
(Supplementary Fig. 3); this result involves all of the geometric and 
material properties. The boundary between the regimes of spreading 
and no spreading, V= V,, is in agreement with the experiments 
(Fig. 2d). 

Because the final state of the drop placed on the fibre array depends 
on the six parameters r, do, L, B, y and V, we conclude by dimensional 
analysis that the system is characterized by three parameters, L/L, max» 
V/V. and do/r where Lymax=L—Lary (from equation (1)) and 
Vo= Tr" Ls max (from equation (2)). We define a phase diagram of 
the three possible final states in the space of the two parameters 
L/Lgmax and V/V, (Fig. 4a). First we identify a threshold, V/V. = 1, 
below which a drop will always totally spread, which maximizes the 
wetted length and the amount of trapped liquid. Second, for V/V. > 1, 
the spreading is partial: the remaining edge drop can be shed by any 
perturbation such as shaking, which results in a smaller amount of 
liquid being captured by the fibre array. The transition from spreading 
to no spreading (V > V,) is identified by equation (3) and depends on 
the parameter d)/r as reflected by the successive hyperbolic curves in 
Fig. 4a. For partial wetting, 0, the effective contact angle, should be 
included in equations (1), (2) and (3) (Supplementary Information 
and Supplementary Figs 4-7). All of the experimental data (symbols 
in Fig. 4a) obtained by varying all of the parameters are well within 
each regime defined by the model. 

This map allows us to predict the interaction of natural or engineered 
fibrous materials with a mist of drops. An example of a natural fibre 
array is a bird feather, which consists of well-ordered hair-like struc- 
tures (barbs and barbules) that produce hydrophobicity and thermal 
insulation’’. Small amounts of oil disrupt this arrangement by clump- 
ing adjacent barbules, affecting their water repellency and insulating 
properties and thus reducing the survival rate of oiled birds’**. We 
sprayed a polydisperse aerosol of oil on goose feathers and observed 
all three possible final states (Fig. 4b). Using our model system, we find 
that a volume of oil less than V. (here a drop radius less than 20 tm) 
spreads, thus clumping adjacent barbules and making the cleaning 
process difficult. Drops larger than V, (drop radius, 140 tm) do not 
spread and may be dislodged from the bird’s plumage. These results are 
in agreement with our map (Fig. 4a). Despite complex initial condi- 
tions (multiple fibres and/or drops, different wettabilities or surface 
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Figure 4 | Aerosol size and fibre matrix properties needed to collect, trap or 
displace a known volume of liquid. a, Map of the three spreading regimes as a 
function of the two dimensionless parameters L/L, max and V/V.. The solid 
curves show three limits for do/r = 2.4, 2.7 and 4.2 and the points show data for 
total spreading (asterisks), partial spreading (squares) and no spreading 
(circles) of silicone oil (total wetting, do/r =2.1; blue and purple points) and 
water (partial wetting, do/r = 3; black and grey points). Stars correspond to the 
three situations observed in b. b, Microscope pictures of goose feathers sprayed 
with oil (smaller drops have volumes of order 10 '*-10°%% m), showing no 
spreading (do/r = 4.8, L/Le max = 2, V/V. ~ 5; blue star in a), total spreading 
(do/r = 3.4, L/Lg max = 1.5 and L, = Ly max = 0.8 mm; white star in a) and partial 
spreading (do/r = 3.5, L/Lemax = 1.5, V/V- ~ 4; pink star in a) in agreement 
with our predictions. Scale bars, 500 um. 


roughnesses) the final states can thus be captured by our model and we 
can predict the main effects arising when a mist of drops interacts with 
a dilute fibre array. 

In addition, aerosol-removal filters, hairsprays, adhesive pads for 
insects and some applications in microstructure design require total 
spreading, that is, optimal coating of the fibres or maximal liquid 
capture. Conversely, fibres in living systems may have evolved a certain 
length or material properties to adapt to their environmental condi- 
tions. For example, we can now use our model to make quantitative 
estimates for various fibrous media and liquids (Supplementary Table 1) 
as reported in the universal map (Fig. 4a). For beetles, we predict that 
drops of optimal diameter 5m released from ventral pores travel 
along the setae and spread totally (that is, without liquid loss) where 
the tarsi contact a substrate; we predict that the collapse of pillars in 
microfabrication, observed during solvent evaporation’”"'’, could be 
controlled by depositing an optimum volume of liquid (drops of dia- 
meter 0.9-5.4\1m); and we predict that microstructures could be 
designed to respond (by changing colour) to aerosols of different drop 
sizes because light scattering is influenced by clustering of the micro- 
elements. These examples illustrate the wide range of flexible systems 
that could be controlled using elastocapillarity and an optimally chosen 
drop volume. 
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METHODS SUMMARY 

The glass fibres (radius, r = 0.145 mm; bending stiffnesses, B= 1 X 10 °Nm? 
andB=7X10’Nm’; length, 2cm < L <5.5 cm) are clamped at one end, sepa- 
rated bya distance 2dp (do/r = 1.9, 2.1, 2.6 and 4.3). A drop of silicone oil (viscosity, 
n=97mPas; density, p = 970kg m *; surface tension, y = 0.021 Nm !) of 
controlled volume V (0.48 pl << V< 2.55 ul) precise to 5% for large volumes and 
to 10% for smaller volumes, is deposited onto the fibres with a micropipette, close 
to the clamped edge. The positions of the front, z(t), and the rear, z,(t), of the drop 
(relative to the clamped end at z = 0), as well as the distance between the fibres, d, 
are recorded from the top with a digital camera. A mirror placed at 45° allows us to 
capture a simultaneous side view of the drop and measure its size, H(z, t). We 
define the average position of the drop as z, = (zp + Z,)/2, and its length is] = zp — z,. 
The capillary length, |. = (y/pg)"”, is the length beyond which gravitational effects 
become more important than capillary effects. Here ], = 1.5mm. 
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Recent contributions of glaciers and ice caps to sea 


level rise 


Thomas J acob't, John Wahr!, W. Tad Pfeffer?”* & Sean Swenson* 


Glaciers and ice caps (GICs) are important contributors to present- 
day global mean sea level rise’*. Most previous global mass balance 
estimates for GICs rely on extrapolation of sparse mass balance 
measurements’** representing only a small fraction of the GIC 
area, leaving their overall contribution to sea level rise unclear. 
Here we show that GICs, excluding the Greenland and Antarctic 
peripheral GICs, lost mass at a rate of 148+30Gtyr' from 
January 2003 to December 2010, contributing 0.41 + 0.08 mm yr" ' 
to sea level rise. Our results are based on a global, simultaneous 
inversion of monthly GRACE-derived satellite gravity fields, from 
which we calculate the mass change over all ice-covered regions 
greater in area than 100 km”. The GIC rate for 2003-2010 is about 
30 per cent smaller than the previous mass balance estimate that 
most closely matches our study period’. The high mountains of 
Asia, in particular, show a mass loss of only 4+ 20Gtyr ‘ for 
2003-2010, compared with 47-55 Gtyr’‘ in previously published 
estimates”*. For completeness, we also estimate that the Greenland 
and Antarctic ice sheets, including their peripheral GICs, con- 
tributed 1.06 + 0.19 mm yr" to sea level rise over the same time 
period. The total contribution to sea level rise from all ice-covered 
regions is thus 1.48 + 0.26 mm yr‘, which agrees well with inde- 
pendent estimates of sea level rise originating from land ice loss and 
other terrestrial sources®. 

Interpolation of sparse mass balance measurements on selected 
glaciers is usually used to estimate global GIC mass balance’. 
Models are also used*’, but these depend on the quality of input 
climate data and include simplified glacial processes. Excluding 
Greenland and Antarctic peripheral GICs (PGICs), GICs have 
variously been reported to have contributed 0.43-0.51mm yr ' to 
sea level rise (SLR) during 1961-2004*”*, 0.77mmyr ' during 
2001-20048, 1.12 mm yr‘ during 2001-2005' and 0.95 mm yr’ ' during 
2002-20067. 

The Gravity Recovery and Climate Experiment (GRACE) satellite 
mission’ has provided monthly, global gravity field solutions since 
2002, allowing users to calculate mass variations at the Earth’s sur- 
face'®. GRACE has been used to monitor the mass balance of selected 
GIC regions''* that show large ice mass loss, as well as of Antarctica 
and Greenland”’. 

Here we present a GRACE solution that details individual mass 
balance results for every region of Earth with large ice-covered areas. 
The main focus of this paper is on GICs, excluding Antarctic and 
Greenland PGICs. For completeness, however, we also include results 
for the Antarctic and Greenland ice sheets with their PGICs. GRACE 
does not have the resolution to separate the Greenland and Antarctic 
ice sheets from their PGICs. All results are computed for the same 8-yr 
time period (2003-2010). 

To determine losses of individual GIC regions, we cover each region 
with one or more ‘mascons’ (small, arbitrarily defined regions of 
Earth) and fit mass values for each mascon (ref. 16 and Supplemen- 
tary Information) to the GRACE gravity fields, after correcting for 


hydrology and for glacial isostatic adjustment (GIA) computed using 
the ICE-5G deglaciation model. We use 94 monthly GRACE solutions 
from the University of Texas Center for Space Research, spanning 
January 2003 to December 2010. The GIA corrections do not include 
the effects of post-Little Ice Age (LIA) isostatic rebound, which we 
separately evaluate and remove. All above contributions and their 
effects on the GRACE solutions are discussed in Supplementary 
Information. 

Figure 1 shows mascons for all ice-covered regions, constructed 
from the Digital Chart of the World’’ and the Circum-Arctic Map 
of Permafrost and Ground-Ice Conditions’*. Each ice-covered region 
is chosen as a single mascon, or as the union of several non-overlapping 
mascons. We group 175 mascons into 20 regions. Geographically iso- 
lated regions with glacierized areas less than 100km* in area are 
excluded. Because GRACE detects total mass change, its results for 
an ice-covered region are independent of the glacierized surface area 
(Supplementary Information). 

Mass balance rates for each region are shown in Table 1 (see 
Supplementary Information for details on the computation of the rates 
and uncertainties). We note that Table 1 includes a few positive rates, 
but none are significantly different from zero. We also performed an 
inversion with GRACE fields from the GFZ German Research Centre 
for Geosciences and obtained results that agreed with those from the 
Center for Space Research (Table 1) to within 5% for each region. 

The results in Table 1 are in general agreement with previous GRACE 
studies for the large mass loss regions of the Canadian Arctic’? and 
Patagonia’, as well as for the Greenland and Antarctic ice sheets with 


Figure 1 | Mascons for the ice-covered regions considered here. Each 
coloured region represents a single mascon. Numbers correspond to regions 
shown in Table 1. Regions containing more than one mascon are outlined with 
a dashed line. 


1Department of Physics and Cooperative Institute for Environmental Studies, University of Colorado at Boulder, Boulder, Colorado 80309, USA. Institute of Arctic and Alpine Research, University of 
Colorado at Boulder, Boulder, Colorado 80309, USA. 3Department of Civil, Environmental, and Architectural Engineering, University of Colorado at Boulder, Boulder, Colorado 80309, USA. 4National Center 
for Atmospheric Research, Boulder, Colorado 80305, USA. Present address: Bureau de Recherches Géologiques et Miniéres, Orléans 45060, France. 


514 | NATURE | VOL 482 | 23 FEBRUARY 2012 


©2012 Macmillan Publishers Limited. All rights reserved 


Table 1 | Inverted 2003-2010 mass balance rates 


Region Rate (Gtyr~) 
1. Iceland =—(122 
2. Svalbard —S22 
3. Franz Josef Land O+2 
4. Novaya Zemlya -4+2 
5. Severnaya Zemlya -l22 
6. Siberia and Kamchatka 2+10 
7. Altai 3+6 
8. High Mountain Asia —-4+20 
8a. Tianshan =5 26 
8b. Pamirs and Kunlun Shan =125 
8c. Himalaya and Karakoram —5 26 
8d. Tibet and Qilian Shan 727 
9. Caucasus L#3 
10. Alps —2+3 
11. Scandinavia 325 
12. Alaska —-46+7 
13. Northwest America excl. Alaska 5+8 
14. Baffin Island —3o 25 
15. Ellesmere, Axel Heiberg and Devon Islands —34+6 
16. South America excl. Patagonia =—62 12 
17. Patagonia =—23 29 
18. New Zealand 243 
19. Greenland ice sheet + PGICs =222 +9 
20. Antarctica ice sheet + PGICs =165 272 
Total —536+93 
GICs excl. Greenland and Antarctica PGICs —148 + 30 
Antarctica + Greenland ice sheet and PGICs —384+71 


Total contribution to SLR 


SLR due to GICs excl. Greenland and Antarctica PGICs 
SLR due to Antarctica + Greenland ice sheet and PGICs 


1.48 + 0.26 mm yr~? 


0.41 + 0.08 mm yr 
1.06 +0.19 mm yr“? 


Uncertainties are given at the 95% (2c) confidence level. 


their PGICs’’. Our results for Alaska also show considerable mass loss, 
although our mass loss rate is smaller than some previously published 
GRACE-derived rates that used shorter and earlier GRACE data spans 
(Supplementary Information). The global GIC mass balance, exclud- 
ing Greenland and Antarctic PGICs, is —148+30Gtyr ', con- 
tributing 0.41 + 0.08 mmyr ' to SLR. 

Mass balance time series for all GIC regions are shown in Fig. 2. The 
seasonal and interannual variabilities evident in these time series have 
contributions from ice and snow variability on the glaciers, as well as 
from imperfectly modelled hydrological signals in adjacent regions 
and from random GRACE observational errors. Interannual variability 
can affect rates determined over short time intervals. Figure 2 and 
Supplementary Table 2 show that there was considerable interannual 
variability during 2003-2010 for some of the regions, especially High 
Mountain Asia (HMA). The HMA results in Supplementary Table 2 
show that this variability induces large swings in the trend solutions 
when it is fitted to subsets of the entire time period. These results suggest 
that care should be taken in extending the 2003-2010 results presented 
in this paper to longer time periods. 

For comparison with studies in which PGICs are included with 
GICs, we upscale our GIC-alone rate to obtain a GIC rate that includes 
PGIC, based on ref. 3 (Supplementary Information). The result is that 
GICs including PGICs lost mass at a rate of 229+82Gtyr ' 
(0.63 + 0.23 mm yr ~ ' SLR), and that the combined ice sheets without 
their PGICs lost mass at 303+ 100Gtyr * (0.84+0.28mmyr * 
SLR). Although no other study encompasses the same time span, 
published non-GRACE estimates for GICs plus PGICs are larger: 
0.98+0.19mmyr ' over 2001-2004%, 1.41+0.20mmyr' over 
2001-2005' and 0.765mmyr ‘ (no uncertainty given) over 2006- 
2010”. These differences could be due to the small number of mass 
balance measurements those estimates must rely on, combined with 
uncertain regional glacier extents. In addition, there are indications 
from more recent non-GRACE measurements that the GIC mass loss 
rate decreased markedly beginning in 2005”°. 

Our results for HMA disagree significantly with previous studies. 
A recent GRACE-based study° over 2002-2009 yields significantly 
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Figure 2 | Mass change during 2003-2010 for all GIC regions shown in 
Fig. 1 and Table 1. The black horizontal lines run through the averages of the 
time series. The grey lines represent 13-month-window, low-pass-filtered 
versions of the data. Time series are shifted for legibility. Modelled 
contributions from GIA, LIA and hydrology have been removed. 
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larger mass loss for HMA than does ours; we explain why the result of 
ref. 5 may be flawed in Supplementary Information. Conventional 
mass balance methods have been used to estimate a 2002-2006 rate 
of —55Gtyr ' for this entire region’, with —29Gtyr over the 
eastern Himalayas alone, by contrast with our HMA estimate, of 
—4+20Gt yr (Table 1). We show results for the four subregions 
of HMA (Fig. 3) in Table 1. 

This difference prompts us to examine this region in more detail. 
GRACE mass trends show considerable mass loss across the plains of 
northern India, Pakistan and Bangladesh, centred south of the glaciers 
and at low elevations (Fig. 3a, b). Some of the edges of this mass loss 
region seem to extend over adjacent mountainous areas to the north, 
but much of that, particularly above north-central India, is leakage of 
the plains signal caused by the 350-km Gaussian smoothing function 
used to generate the figure. The plains signal has previously been 
identified as groundwater loss'®*’. To minimize leakage in the HMA 
GIC estimates, additional mascons are chosen to cover the plains 
(Fig. 3a), the sum of which gives an average 2003-2010 water loss rate 
of 35Gtyr ‘. Our plains results are consistent with the results of refs 
16 and 21, which span shorter time periods. 

The lack of notable mass loss over glacierized regions is consistent 
with our HMA mascon solutions that indicate relatively modest losses 
(Table 1). We simulate what the ice loss rates predicted by ref. 2 would 
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Figure 3 | HMA mass balance determination. a, Topographic map overlaid 
with the HMA mascons (crosses) and India plain mascons (dots); the dashed 
lines delimit the four HMA subregions (labelled as in Table 1). b, GRACE mass 
rate corrected for hydrology and GIA and smoothed with a 350-km Gaussian 
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Longitude east 


Longitude east 


look like in the GRACE results. We use those rates to construct 
synthetic gravity fields and process them using the same methods 
applied to the GRACE data, to generate the trend map shown in 
Fig. 3c. It is apparent that an ice loss of this order would appear in 
the GRACE map as a large mass loss signal centred over the eastern 
Himalayas, far larger in amplitude and extent than the GRACE results 
in that region (compare Fig. 3b with Fig. 3c). 

It is reasonable to wonder whether a tectonic process could be 
causing a positive signal in the glacierized region that offsets a large 
negative glacier signal in HMA. To see what this positive rate would have 
to look like, we remove the simulated gravity field (based on ref. 2) from 
the GRACE data and show the resulting difference map in Fig. 3d. If the 
ice loss estimate were correct, the tectonic process would be causing an 
anomalous mass increase over the Himalayas of ~3cm yr ' equivalent 
water thickness, equivalent to ~1cmyr | of uncompensated crustal 
uplift. Although we cannot categorically rule out such a possibility, it 
seems unlikely. Global Positioning System and levelling observations in 
this region indicate long-term uplift rates as large as 0.5-0.7 cm yr! in 
some places**”’. But it is highly probable that any broad-scale tectonic 
uplift would be isostatically compensated by an increasing mass 
deficiency at depth, with little net effect on gravity* and, consequently, 
no significant contribution to the GRACE results. The effects of com- 
pensation are evident in the static gravity field. Supplementary Fig. 4 
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smoothing function, overlaid with the HMA mascons. w.e., water equivalent. 
c, Synthetic GRACE rates that would be caused bya total mass loss of 55 Gt yr 
over HMA mascons, with 29 Gt yr! over the eastern Himalayas, after ref. 2. 

d, The difference between b and c. 
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shows the free-air gravity field, computed using a 350-km Gaussian 
smoothing function (used to generate Fig. 3) applied to the EGM96 
mean global gravity field”. The topography leaves no apparent sig- 
nature on the static gravity field at these scales, indicating near-perfect 
compensation. 

For a solid-Earth process to affect GRACE significantly, it must be 
largely isostatically uncompensated, which for these broad spatial 
scales would require characteristic timescales of the same order or less 
than the mantle’s viscoelastic relaxation times (several hundred to a 
few thousand years). One possible such process might be the ongoing 
viscoelastic response of the Earth to past glacial unloading. We have 
investigated this effect, as well as possible contributions from erosion, 
and find that neither is likely to be important (Supplementary 
Information). 

Another possible explanation for the lack of a large GRACE HMA 
signal is that most of the glacier melt water might be sinking into the 
ground before it has a chance to leave the glaciated region, thus causing 
GRACE to show little net mass change. Some groundwater recharge 
undoubtedly does occur, but it seems unlikely that such cancellation 
would be this complete. Much of HMA, for example, is permafrost, so 
local storage capacity is small (see the Circum-Arctic Map of 
Permafrost and Ground-Ice Conditions; http://nsidc.org/fgdc/maps/ 
ipa_browse.html). Therefore, although there would be surface melt, 
the frozen ground would inhibit local recharge and there would be 
little ability to store the melt water locally. How far the water might 
have to travel before finding recharge pathways, we do not know. It is 
true that some rivers originating in portions of HMA do not reach the 
sea. Most notable are the Amu Darya and Syr Darya, which historically 
feed the Aral Sea but have been diverted for irrigation. Any fraction of 
that diverted water that ends up recharging aquifers will not directly 
contribute to SLR. However, the irrigation areas lie well outside our 
HMA mascons, and so even if there is notable recharge it is unlikely to 
affect the HMA mascon solutions significantly. 

Our emphasis here is on GICs; the Greenland and Antarctic ice 
sheets have previously been well studied with GRACE". But for com- 
parison with non-GRACE global estimates, we combine our GIC results 
with our estimates for Greenland plus Antarctica to obtain a total SLR 
contribution from all ice-covered regions of 1.48+0.26mmyr ' 
during 2003-2010. Within the uncertainties, this value compares 
favourably with the estimate of 18+0.5mmyr ' for 2006 from 
ref 4. However, there are regional differences between these and prior 
results, which need further study and reconciliation. 

SLR from the addition of new water can be determined from 
GRACE alone as well as by subtracting Argo steric heights from 
altimetric SLR measurements’. The most recent new-water SLR 
estimate, comparing the two methods, is 1.3+0.6mmyr ' for 
2005-2010°, which agrees with our total ice-covered SLR value to 
within the uncertainties. The difference, 0.2 + 0.6 mm yr ', could rep- 
resent an increase in land water storage outside ice-covered regions, 
but we note that it is not significantly different from zero. 


METHODS SUMMARY 


GRACE solutions consist of spherical harmonic (Stokes) coefficients and are used 
to determine month-to-month variations in Earth’s mass distribution””’. We use 
monthly values of C29 (the zonal, degree-2 spherical harmonic coefficient of the 
geopotential) from satellite laser ranging’®, and include degree-one terms”. 

To determine mass variability for each mascon, we find the set of Stokes coeffi- 
cients produced by a unit mass distributed uniformly across that mascon. We fit 
these sets of Stokes coefficients, simultaneously, to the GRACE Stokes coefficients, 
to obtain monthly mass values for each mascon. This method is similar to prev- 
iously published mascon methods”, though here we fit to Stokes coefficients 
rather than to raw satellite measurements and we do not impose smoothness 
constraints. To determine the optimal shape and number of mascons in a region, 
we construct a sensitivity kernel for several possible configurations, and choose the 
configuration that optimizes that kernel and minimizes the GRACE trend residuals 
(Supplementary Fig. 1c). 
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The average of two land surface models is used to correct for hydrology, and the 
model differences are used to estimate uncertainties (Supplementary Information). 

LIA loading corrections have been previously derived for Alaska’? and 
Patagonia”, and equal 7 and 9 Gt yr‘, respectively. These numbers are subtracted 
from our Alaska and Patagonia inversions. For other GIC regions, where LIA 
characteristics are not well known, we estimate an upper bound for the correction 
by constructing a GIA model that tends to maximize the positive LIA gravity 
trend. Ofall the additional GIC regions, only HMA has a predicted LIA correction 
that reaches 1 Gtyr’ ’. There, the model suggests we remove 5Gtyr ‘ from our 
inverted result. But because the LIA correction in this region is likely to be an 
overestimate (Supplementary Information), our preferred result splits the differ- 
ence (Supplementary Table 1), and we use that difference to augment the total 
HMA uncertainty. 
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The microRNA miR-34 modulates ageing and 
neurodegeneration in Drosophila 
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Human neurodegenerative diseases have the temporal hallmark of 
afflicting the elderly population. Ageing is one of the most prominent 
factors to influence disease onset and progression’, yet little is known 
about the molecular pathways that connect these processes. To 
understand this connection it is necessary to identify the pathways 
that functionally integrate ageing, chronic maintenance of the brain 
and modulation of neurodegenerative disease. MicroRNAs (miRNA) 
are emerging as critical factors in gene regulation during develop- 
ment; however, their role in adult-onset, age-associated processes is 
only beginning to be revealed. Here we report that the conserved 
miRNA miR-34 regulates age-associated events and long-term brain 
integrity in Drosophila, providing a molecular link between ageing 
and neurodegeneration. Fly mir-34 expression exhibits adult-onset, 
brain-enriched and age-modulated characteristics. Whereas mir-34 
loss triggers a gene profile of accelerated brain ageing, late-onset 
brain degeneration and a catastrophic decline in survival, mir-34 
upregulation extends median lifespan and mitigates neurodegenera- 
tion induced by human pathogenic polyglutamine disease protein. 
Some of the age-associated effects of miR-34 require adult-onset 
translational repression of Eip74EF, an essential ETS domain tran- 
scription factor involved in steroid hormone pathways. Our studies 
indicate that miRNA-dependent pathways may have an impact on 
adult-onset, age-associated events by silencing developmental genes 
that later have a deleterious influence on adult life cycle and disease, 
and highlight fly miR-34 as a key miRNA with a role in this process. 

Recent evidence reveals that miRNA pathways are important in the 
adult nervous system, notably in the maintenance of neurons and in the 
regulation of genes and pathways associated with neurodegenerative 
disease”. Given these findings, we considered that there may be a 
fundamental role for select miRNAs in ageing. We examined flies 
carrying a hypomorphic mutation in loquacious (logs), a key gene in 
fly miRNA processing* (Supplementary Fig. 1a). Flies bearing the 
logs! 00791 mutation were viable, but detailed examination indicated a 
significantly shortened lifespan (Supplementary Fig. 1b). Further ana- 
lysis indicated that logs” flies showed late-onset brain morphological 
deterioration: although normal as young adults, by 25 days logs'°””! 
flies developed large vacuoles in the retina and lamina of the brain 
(Supplementary Fig. 1c). Although developmental processes may con- 
tribute to shortened lifespan, the adult-onset brain degeneration of 
logs”! mutants indicated that one or more specific miRNAs may 
be critically involved in age-associated events impacting on long-term 
brain integrity. 

To explore this question, we determined whether specific miRNAs 
displayed age-modulated expression in the brain. RNA was isolated 
from dissected brains of adult flies of young (3 days), mid (30 days) and 
old time points (60 days). Using an array for Drosophila miRNAs, 29 
were expressed in the adult brain (Fig. 1a). Whereas most miRNAs 
maintained a steady level or decreased with age, one miRNA, mir-34, 


increased (Fig. la). Small RNA northern blot analysis confirmed that 
mir-34 expression was barely detectable during development, but 
became high in the adult and was further upregulated with age (Sup- 
plementary Fig. 2a, b). Expression of mir-34 was affected in logs”?! 
flies (Supplementary Fig. 1d). miR-34 falls into a category of 
Drosophila miRNAs whose processing requires the exoribonuclease 
nibbler (nbr)**. In the adult, mature miR-34 displayed three major 
differentially sized forms (24 nucleotides, 22 nucleotides and 21 
nucleotides) with a uniform 5’ end, descending by single nucleotides 
at the 3’ end which result from nbr-mediated trimming; only isoform c 
became upregulated with age (Supplementary Fig. 2c and Fig. 1b, c; see 
also refs 5-7). 

miR-34 is a markedly conserved miRNA, with orthologues in fly, 
Caenorhabditis elegans, mouse and human showing identical seed 
sequence (Supplementary Fig. 2d). To define miR-34 function, flies 
deleted for the gene were generated (Supplementary Fig. 3a). The 
resulting mir-34 mutant flies retained normal wild-type expression 
of neighbouring genes, but selectively lacked mir-34 (Supplementary 
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Figure 1 | Drosophila mir-34 expression is upregulated with age. a, Heat 
map of fold-change of Drosophila miRNAs in brains aged 3 days, 30 days and 
60 days. Twenty-nine miRNAs (shown) were flagged present out of a total of 
seventy-eight. One-way analysis of variance defined significance for each 
miRNA over all time points (***P < 0.001; = 3 replicates). Genotype: iso31. 
b, Fly miR-34 isoform c shows age-modulated expression in fly heads. Left 
panels: miR-34 shows three major mature forms (labelled a, b and c), but only 
isoform c increases with age. Right panels: quantification of miR-34 isoforms 
with age. n = 3 independent experiments; signal density of all isoforms 
normalized at the same time point to 2S rRNA loading control. *P < 0.01; 
**P < 0.001; one-way analysis of variance, with post test: Tukey’s multiple 
comparison test. Genotype: 5905. c, Sequences of miR-34 isoforms are 
generated through nbr-dependent 3’-end trimming. 
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Fig. 3b, c). To interrogate age-associated phenotypes carefully, we 
generated mir-34 null flies in the same uniform homogeneous genetic 
background (see Methods). mir-34 mutants displayed no obvious 
developmental defects, consistent with its adult-onset expression. 
However, detailed examination of adult animals indicated that mir-34 
mutants, although showing normal adult appearance and early survival, 
displayed a catastrophic decline in viability just after 30 days (Fig. 2a 
and Supplementary Table 4). Analysis of age-associated functions 
revealed that young mutants (3 days) had normal locomotion and stress 
resistance, but by 20 days the mutants had dramatic climbing deficits 
and were markedly stress-sensitive compared to age-matched controls 
(Fig. 2b). Because mir-34 expression was brain-enriched, we also 
examined the brain. Typically, older flies show sporadic, age- 
correlated vacuoles in the brain—a morphological hallmark of neural 
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Figure 2 | miR-34 modulates age-associated processes. a, mir-34 mutant flies 
have a shortened lifespan (control: 64 days median, 90 days maximal lifespan; 
mir-34: 40 days medium, 64 days maximal lifespan; P < 0.0001, log-rank test). 
Mean + s.e.m., nm = 240 male flies per curve. Genotypes: control, 5905; mir- 
34-'~, miR-34 null-1 in 5905 homogenous genetic background. b, mir-34 
mutant flies have late-onset behavioural deficits. Left: for locomotion 
behaviour, mir-34 mutant flies show normal climbing at 3 days. At 20 days, 
50 + 3.4% mir-34 mutant flies fail to climb; in contrast, only 22.1 + 2.4% of 
control flies have defective climbing. Mean ~ s.e.m. of 3 experiments, n = 120- 
140 male flies per experiment. Right panel: for stress resistance, mir-34 mutant 
flies have normal resistance to heat stress at 3 days. mir-34 mutant flies become 
markedly sensitive to heat shock with age, such that at 20 days, only 

27.5 + 3.8% survive after heat stress. In contrast, 76.7 + 9.6% of control flies 
survive after the same treatment. Mean = s.e.m. of 3 experiments, n = 120-140 
male flies. ***P < 0.0001 (two-way analysis of variance). Genotypes as in 

a. c, mir-34 mutant flies show age-associated brain degeneration. Top-left 
panel: mir-34 mutant flies have normal brain morphology at 3 days. Major 
anatomical structures: CB, central brain; La, lamina; Lo; lobula; LoP, lobula 
plate; Me, medulla; Rt, retina. At 3 days, control flies have normal brain 
morphology (not shown), but develop a small number of sporadic vacuoles at 
30 days (top-right panel, arrowheads). Middle panel: aged mir-34 mutants 
(30 days) show striking vacuoles in the medulla (arrows) and other regions of 
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deterioration®. mir-34 mutants were born with normal brain morpho- 
logy, but showed dramatic vacuolization with age, indicative of loss of 
brain integrity (Fig. 2c). Rescue with a 9-kb genomic DNA fragment 
containing mir-34 and its endogenous cis-regulatory elements (Sup- 
plementary Fig. 3a, b) partially restored the age-associated expression 
of mir-34 to mir-34 null flies in the same homogeneous genetic back- 
ground (Supplementary Fig. 3d). Although rescue was not complete, 
indicative of a complexity in genomic elements that regulate mir-34, 
rescue was sufficient to mitigate the mutant effects, indicating that 
miR-34 function normally underlies these age-associated aspects 
(Supplementary Table 1). 

These data indicated that mir-34 mutants were normal as young 
adults, but with age developed deficits reflective of much older animals, 
including loss of locomotion, stress sensitivity and brain deterioration, 
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the brain (arrowheads). Bottom: the number of vacuoles in mir-34 mutants is 
significantly higher than in controls (22.2 + 1.8 versus 1.5 + 0.3 in medulla; 
19.2 + 2.5 versus 7.0 + 0.9 in other regions of the brain; **P < 0.001, one-way 
analysis of variance, with post test: Tukey’s multiple comparison test). 

Mean ~ s.e.m., 1 = 10 independent male fly brains. Genotypes as in a. Scale 
bar: 0.1 mm. d, mir-34 mutant flies have a transcriptional profile indicative of 
accelerated ageing. Top panel: 173 age-correlated probe sets were defined from 
a transcriptional profile of fly brains at 3 days, 30 days and 60 days of age. 
Arrowheads indicate time points (3 days and 20 days) at which mir-34 mutants 
and controls were compared. Genotype: iso31 flies used for transcriptional 
profiles of normal ageing brains. n = 3 biological replicates for each time point. 
P=0.001, false discovery rate (FDR) = 0.062, linear regression model. Bottom 
panel: scatter plot illustrates the relative expression of 173 probe sets, which 
shows a significant difference between mir-34 mutants and age-matched 
controls (P = 0.006, two-sample, paired Wilcoxon test). Whereas the pattern 
for positively correlated probe sets (red), indicated by the contour lines, is 
significantly different (P = 0.0001) between the two genotypes, and tends to 
show higher expression in mir-34 mutants compared to controls, it is not for 
negatively correlated probe sets (blue) (P = 0.9583). Contour lines indicate that 
positively correlated probe sets tend to show higher expression in mir-34 
mutants compared to controls. Genotypes as ina. n = 5 biological replicates for 
each time point. 
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coupled with shortened lifespan. We therefore hypothesized that loss 
of mir-34 accelerated brain ageing. To address this, we transcriptionally 
profiled the fly brain (3 days, 30 days and 60 days) from wild-type 
animals. On the basis of a linear regression model’, we extracted 173 
probe sets from this profile the expression of which was tightly corre- 
lated with the progression of normal ageing (Fig. 2d and Supplemen- 
tary Tables 2 and 3). We next made another set of brain transcriptional 
profiles for mir-34 mutants and controls of matched chronological age 
(3 days and 20 days). We measured relative changes of these probe sets 
between 3 days and 20 days within each genotype, and compared the 
extent of such changes between mir-34 mutants and controls. This 
indicated that the overall pattern of these probe sets was significantly 
different between the two genotypes (P = 0.006, two-sample, paired 
Wilcoxon test; Fig. 2d). In particular, most positively correlated probe 
sets displayed a faster pace of increase in mir-34 mutants compared 
to controls—thus showing accelerated age-associated expression 
changes in mir-34 mutants (Fig. 2d). This result, combined with the 
physiological and histological evidence of more rapid loss of age- 
associated functions, suggested that mir-34 mutants were undergoing 
accelerated brain ageing. 

miRNAs function by binding to the 3’ UTRs of target mRNAs and 
often result in downregulation of protein translation. We therefore 
reasoned that age-associated activities of miR-34 might be mediated 
through silencing of critical targets that have a negative impact on the 
adult animal. miRNA-target prediction algorithms indicated miR-34 
binding sites within the 3’ UTR of the Eip74EF gene; notably, these 
binding sites were conserved in the orthologous Eip74EF genes from 
different Drosophila species (Supplementary Fig. 4a). We confirmed 
the miR-34 interaction through mutations in the seed sequences of the 
predicted miR-34 binding sites in the 3’ UTR of the Eip74EF mRNA 
(Supplementary Fig. 4b). The Eip74EF gene is a component of steroid 
hormone signalling pathways. Although such pathways have generally 
been studied for effects during development, data have implicated 
these pathways in lifespan regulation”®. 

The Eip74EF gene encodes two major protein isoforms, E74A and 
E74B (referred to as the E74A and E74B genes, respectively''); the 
isoforms share the same 3’ UTR (Supplementary Fig. 4a). Northern 
blots indicated that transcription of E74A, but not E74B, persisted in 
adults, overlapping the time period when mir-34 is expressed 
(Supplementary Fig. 4c). Given this, we focused on E74A as a regulated 
target of miR-34 in the adult. Despite robust expression of the mRNA 
transcript, the E74A protein was expressed at low levels in adult heads 
throughout lifespan (Fig. 3a, b and Supplementary Fig. 4d). In flies 
lacking miR-34, E74A protein was markedly increased (Fig. 3b); E74A 
was also de-regulated in the logs”! mutant flies (Supplementary 
Fig. le). Genomic rescue of mir-34 mitigated this de-regulation of 
the E74A protein (Fig. 3c). Fine temporal analysis indicated that the 
E74A protein was highly expressed in young flies, but underwent a 
marked decrease within a 24-h time window (Supplementary Fig. 5). 
This temporal pattern seemed to be mutually exclusive to that of 
miR-34 (see Supplementary Fig. 2a). Moreover, in flies lacking 
miR-34, the downregulation of E74A protein during this critical period 
was dampened (Supplementary Fig. 5). This evidence indicates that 
adult-onset expression of mir-34 functions, at least in part, to attenuate 
E74<A protein expression in the young adult, and maintain that repres- 
sion through adulthood (Supplementary Fig. 4d). 

We next determined whether deregulated expression of E74A protein 
contributed to the age-associated defects in mir-34 mutants. Because 
E74A function is essential during development, with strong mutations 
leading to pre-adult lethality”, we used the mild, but viable, E74A3601805 
hypomorphic mutation (Supplementary Fig. 4a). When the E74A°0018 
mutation was combined with mir-34 mutant flies in the same 
homogenous genetic background, proper regulation of E74A protein 
was partially restored (Fig. 3d), and age-associated defects due to loss of 
mir-34, including shortened lifespan and brain vacuolization, were 
mitigated (Fig. 3e, f; E74A®6985 mutants alone have a normal lifespan 
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(Supplementary Fig. 6a)). To assess further the adult activity of E74A, 
we upregulated E74A in the adult with an E74A transgene that lacks 
miR-34 binding sites driven by a temperature-sensitive promoter’’. At 
29 °C, these flies demonstrated increased levels of E74A expression in 
the adult (Supplementary Fig. 6b). Notably, these animals also showed 
late-onset brain degeneration (Supplementary Fig. 6c) and a signifi- 
cantly shortened lifespan (Supplementary Fig. 6d). These data indicate 
that deregulated expression of E74A has a negative impact on normal 
ageing, and that one function of miR-34 is to silence E74A in the adult 
to prevent the adult-stage deleterious activity of E74A on brain 
integrity and viability. 

Notably, during the course of these studies, we noted that mir-34 
mutants also displayed a defect in protein misfolding—a molecular pro- 
cess implicated in ageing and common to many human neurodegenera- 
tive diseases'’. Whereas normally with age, the fly brain accumulates a 
low level of inclusions that immunostain for stress chaperones like 
Hsp70/Hsc70, mir-34 mutants showed a marked increase compared 
to control flies of matched age (30 days) (Supplementary Fig. 7). Given 
that mir-34 expression increases with age, and mir-34 loss shows altered 
chaperone accumulation, we tested whether mir-34 expression itself 
is upregulated by stresses like heat shock or oxidative toxins, but found 
no evidence to support this (data not shown). However, given that loss 
of mir-34 caused an increase in protein misfolding, this raised the 
possibility that upregulation of mir-34 expression might mitigate 
disease-associated protein misfolding. In Drosophila, expression of a 
pathogenic ataxin-3 polyglutamine (polyQ) disease protein (SCA3trQ78) 
leads to inclusion formation, a decrease in polyQ protein solubility and 
progressive neural loss'* (Supplementary Fig. 8a). Upregulation of 
mir-34 markedly mitigated polyQ degeneration, such that inclusion 
formation was slowed, the protein retained greater solubility, and neural 
degeneration was suppressed (Fig. 3g, h and Supplementary Fig. 8b-d). 
Lowering E74A expression by heterozygous reduction in flies expressing 
pathogenic polyQ protein revealed a minimal effect (data not shown), 
indicating that E74A may not be a target of miR-34 activity in this 
process. However, our studies with E74A were of necessity limited to 
hypomorphic alleles that may not uncover the full extent of E74A func- 
tion mediated by miR-34. Furthermore, additional targets of miR-34 
may be involved in different aspects of miR-34-directed pathways, 
including disease. 

Given this effect to mitigate disease-associated neural toxicity with 
upregulation of mir-34, and that mir-34 expression naturally increases 
with age, we investigated whether enhanced expression of mir-34 in 
wild-type flies could modulate the ageing process. We increased miR- 
34 dosage in wild-type flies with genomic rescue transgenes, which 
express mir-34 under its endogenous regulatory elements (see 
Supplementary Fig. 2a). Analysis of multiple independent transgenics 
in the same genetic background with that of control indicated that 
upregulation of miR-34 levels with genomic constructs (~20%, 
Supplementary Fig. 3d) promoted median survival rate by ~10% 
compared to wild type (Fig. 3i; other traits, such as the occurrence of 
brain vacuolization, despite being an age-associated phenomenon, are 
sporadic and low in normal flies, thus were difficult to assess). Thus, 
upregulation of mir-34 expression can protect from neurodegenerative 
disease and extend median lifespan. 

Our findings indicate that miR-34 in Drosophila presents a key 
miRNA that couples long-term maintenance of the brain with healthy 
ageing of the organism. miR-34 activity, enhanced by its age- 
modulated expression and processing, is critically involved in silencing 
of the E74A transcript through adulthood and in modulation of pro- 
tein homeostasis with age, as well as in polyQ disease. Select neural cell 
types may be especially vulnerable in ageing and disease’; miR-34 
function may have an impact on the integrity or activity of these 
systems. Intriguingly, E74A seems to confer sharply opposing function 
on animal fitness at different life stages, being essential during pre- 
adult development"', but harmful to the adult during ageing (this 
study). This biological property—of a gene being beneficial at one 


23 FEBRUARY 2012 | VOL 482 | NATURE | 521 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


a E74A mRNA b E74A protein 
3d 30d 3d 30d 
x nv” 
ee a & XK > af oO of 
Sow SF MF 


” att a e 


ea 


wit 


Relative fold change Relative fold change 


ee ee wee ulin 


1 Control 


E74A protein d E74A protein 
mir-34-- 
ys x7) xX 
eof * b é ae 
S NASR Ro 
Fe eS s&s onan 


103 kDa- _—oe 103 i one 


er Tubulin a ee ee Tubulin 
10 
5 — 
1 


sion fold change Relative fold change 


e sale g SCA3trQ78_ SCA3trQ78; mir-34(+) 
9 IB mir-347-;E74A8S/+ 
100 29 0C,P<0.0001  Bimirga-“e7aaPCe74a96 Dar! 

~~ S P<0.05 

x fo) 

Oo 1o) 

2 5 

oO c=] 
© 

n = Control .N 

2@ mir 34%) E74ABS/+ 5 

ic 5 

rane ji74Abe ce) = 
0) g ; 
0 10 20 30 40 50 Medulla Other regions 


Days after eclosion 
SCA3Q78(21d) i 


Control (21d) = mir-34 (+) 


Flies alive (%) 


6.99+0.08 2.4641.32 


6.90+0.34"" 


Figure 3 | The Drosophila Eip74EF gene is a target of miR-34 in modulation 
of the ageing process. a, E74A mRNA is robustly expressed in the adult and 
unchanged between age-matched controls and mir-34 mutants. In control flies, 
E74A mRNA is significantly upregulated in 30 day compared to 3 day animals. 
RNA was from male heads. Mean = s.e.m., n = 3 independent experiments; 
signal density of E74A mRNA normalized to 18S rRNA loading control 

(*P < 0.01, one-way analysis of variance, with post test: Tukey’s multiple 
comparison test). Genotypes: control, 5905; mir-34_‘~, mir-34 null-1 in 5905 
homogenous genetic background. b, E74A protein is deregulated in mir-34 
mutants. Protein was from male heads. Mean = s.e.m., n = 3 independent 
experiments; signal density of E74A protein normalized to tubulin loading 
control (*P < 0.01, one-way analysis of variance, with post test: Tukey’s 
multiple comparison). Genotypes as in a. c, Deregulation of E74A protein is 
diminished in mir-34 rescue flies. Protein was from male heads. Mean = s.e.m., 
n = 3 independent experiments; signal density normalized to tubulin loading 
control (*P < 0.05, one-way analysis of variance, with post test: Tukey’s 
multiple comparison test). Genotypes: control, 5905; mir-34 /~, mir-34 null-1 
in 5905 homogenous genetic background; mir-34—'~; mir-34( =. mir-34 
genomic rescue in mir-34 null-1 in 5905 homogenous genetic background. 

d, mir-34 mutants homozygous for the E74A®°"!8® allele have lower levels of 
E74A protein. Protein was from male heads of 20 day flies raised at 29 °C. 
Mean + s.e.m., n = 3 independent experiments; signal density normalized to 
tubulin loading control (*P < 0.01, one-way analysis of variance, with post test: 
Tukey’s multiple comparison test). Genotypes: control, 5905; mir-34 /~ 
E74A°°/4+, E74A2016/ 4. miR-34 null-1 in 5905 homogenous genetic 
background; mir-34/~ E74A°S/E74A°°, E74A% 0189/6744 O01, miR-34 
null-1 in 5905 homogenous genetic background. e, f, Reducing E74A protein 
levels in the adult mitigates age-related defects of mir-34 mutants. mir-34 
mutants also homozygous for E74A°°'%°° show rescued lifespan (e) and brain 
morphology (f), compared to mir-34 mutants heterozygous for E74A200!80° 
(these flies have a lifespan that is the same as mir-34 mutants alone; see 


life stage, but damaging at another—is referred to as antagonistic 
pleiotropy’®. Genes associated with antagonistic pleiotropy are likely 
to be evolutionarily retained due to their earlier beneficial function”. 
Their adult-onset activities, however, antagonize the ageing process if 
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Supplementary Table 4). Flies raised at 29 °C. Lifespan: P< 0.0001 (log-rank 
test). Mean = s.e.m., 1 = 150 male flies. Brain vacuoles: *P < 0.01 (one-way 
analysis of variance, with post test: Tukey’s multiple comparison test). 

Mean + s.e.m., m = 10 independent male animals. Genotypes as in 

d. g, Upregulation of mir-34 reduces accumulation of pathogenic polyQ protein 
inclusions. Left panels: in the retina of flies expressing SCA3trQ78 alone, 
pathogenic polyQ protein is initially diffuse (1 day, top), but gradually 
accumulates into nuclear inclusions (3 day, bottom). Right panels: upregulation 
of mir-34 reduces inclusion formation. DAPI staining highlights nuclei. 3 day 
controls show 53.75 + 12.55 inclusions in a retinal section versus 23.67 + 7.57 
with mir-34 upregulation; mean + s.d., n = 3 cryosections from independent 
male animals; P < 0.01 (t-test). Genotypes: SCA3trQ78 is w';rh1-GAL4, UAS- 
SCA3trQ78/+. SCA3trQ78; mir-34 (+) is w*; rh1-GAL4, SCA3trQ78/+; UAS- 
mir-34/+. Scale bar, 0.05 mm. h, Upregulation of mir-34 prevents neural 
degeneration. At 21 days, male flies expressing SCA3trQ78 show a marked loss 
of photoreceptor neuronal integrity (middle panel), with an average of only 
2.46 + 1.32 photoreceptors per ommatidium remaining by pseudopupil 
analysis. Flies with upregulated mir-34 (right panel) retain 6.90 + 0.34 
photoreceptors per ommatidium. Control (left panel) and upregulation of mir- 
34 alone (not shown) have normal photoreceptor numbers per ommatidium. 
Mean = s.d., n = 619, 722 and 700 ommatidia, for SCA3trQ78, SCA3trQ78; 
mir-34 (+) and control, respectively; ***P < 0.0001 (one-way analysis of 
variance, with Bonferroni’s multiple comparison test). Genotypes as in 

b; control: w*; rh1-GAL4/+. Scale bar, 0.05 mm. i, Flies with upregulated mir- 
34 (colour) have an extended median lifespan compared to control flies (black 
and grey curves for repeats 1 and 2, respectively) (log-rank test). Lifespan result 
for each genotype is indicated in median and maximal days. Mean + s.e.m., 
n= 150 male flies per genotype, 25 °C. Three independent mir-34 genomic 
transgenic lines (4, 8, 9) were analysed. Genotypes: control, 5905; mir-34 (+), 
mir-34 genomic rescue in 5905 homogenous genetic background. 


they are not properly regulated. miRNA pathways provide a 
tantalizing mechanism by which to suppress potentially deleterious 
age-related activities of such genes; a number of miRNAs have been 
noted to show age-modulated expression and activity'*’. Roles of 
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select miRNAs normally expressed in the adult may be of evolutionary 
advantage to tune-down events that promote age-associated decline 
and potentially disease, in order to prolong healthy lifespan and 
longevity. Upregulation of lin-4, a C. elegans miRNA with a known 
developmental role, extends nematode lifespan”, raising the possibility 
that this upregulation, like the natural increase of mir-34 expression in 
Drosophila, functions to silence genes that have a negative impact on 
ageing and potentially promote disease. Notably, mir-34 expression is 
elevated with age in C. elegans’””°, and mammalian mir-34 orthologues 
are highly expressed in the adult brain’’ and have also been noted to 
increase with age and be misregulated in degenerative disease in 
humans””®*. Current data regarding miR-34 function indicate that it 
is neutral or adverse in C. elegans’*”’, and can be either protective 
or contributory to age-associated events in vertebrates *°. Thus, 
miR-34 seems to be a key miRNA poised to integrate age-associated 
physiology; the precise function will reflect the diverse spatiotemporal 
expression and activity of distinct orthologues, the mRNA target 
spectrum, as well as the complexity of the adult brain and life cycle. 
The conservation of miR-34, coupled with in-depth comparative 
analysis of mir-34 expression, 3' end processing, targets and pathways 
in the ageing process of nematodes, flies and mammals, make it a 
tempting subject for understanding features of ageing and disease 
susceptibility. 


METHODS SUMMARY 


Flies were grown in standard media at 25 °C unless otherwise specified. Stock lines 
and GAL4 driver lines were obtained from the Drosophila Stock centre at 
Bloomington, or are described‘. Deletion of the mir-34 region was made by site- 
specific recombination. Fly transgenics were generated by standard procedures. 
Flies were generated or backcrossed a minimum of five generations into a controlled 
uniform homogeneous genetic background (line 5905 (FlyBase ID FBst0005905, 
w'7'8)), to assure that all phenotypes were robust and not associated with variation 
in genetic background. In this uniform homogeneous genetic background, the 
lifespan of control flies is highly uniform with repetition when 150 or more indi- 
viduals are used for lifespan analysis. Negative geotaxis and thermo stress were used 
to examine fly locomotion and stress resistance, respectively. Adult male heads were 
processed for paraffin sections as described'*. To determine lifespan, newly eclosed 
males were collected and maintained at 15 flies per vial, transferred to fresh vials 
every 2 days while scored for survival. A total of 150-200 flies were used per 
genotype per lifespan; all experiments were repeated multiple times (see 
Supplementary Table 4). Lifespans were analysed in Excel (Microsoft) and by 
Prism software (GraphPad) for survival curves and statistics. Techniques of 
molecular biology, western immunoblots and histology were standard. Fly brain 
mRNA was prepared using Trizol reagent for array and mRNA analysis, miRNA 
arrays were miRCURY LNA arrays version 8.1 (Exiqon), and mRNA expression 
was profiled using Affymetrix Drosophila 2.0 chips (Affymetrix). The microarray 
data can be found in the Gene Expression Omnibus (GEO) of NCBI through 
accession number GSE25009. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Genetic background. Fly lines were from the Bloomington Stock centre or are 
described'*. To control for background effects, and to assess significance of all 
effects, flies were generated in the same uniform homogeneous genetic background 
(line 5905 (FlyBase ID FBst0005905, w'!'8)). or backcrossed a minimum of five 
generations into this uniform genetic background. This assured that, for all 
phenotypes, even modest and consistent effects were associated with the gene 
manipulations and not a variation in background. With these carefully controlled 
experiments, the lifespan of control flies was highly uniform upon repetition, 
when 150 or more individuals were used for lifespan analysis (see Supplemen- 
tary Table 4). 

mir-34 deletion mutants. Deletion of the mir-34 region was made by site-specific 
recombination between two piggyBac insertions, using FLP-FRT-mediated site- 
specific recombination’*. The loss of other genes in the region was then fully 
rescued by genomic transgenes, so that a line selectively lacking only mir-34 was 
generated. Two FRT-bearing insertions, PBac[XP]d02752 and PBac[RB]Fmr1°°?””, 
were used (Exelixis collection), which encompass the mir-34 region. Genetic crosses 
were made to combine these two transposon elements with heat-inducible FLP 
recombinase. After 48h of egg laying, parents were removed, and vials containing 
progeny were placed in a 37 °C water bath for a 1-h heat shock. Progeny flies were 
treated with daily 1-h heat shock, for an additional 4 days. Young virgin female 
progeny flies were collected and crossed to males with 3rd chromosome balancers. 
In the subsequent generation, progeny males were used to generate additional 
progeny for PCR confirmation. Progeny flies bearing the deletion were positive 
for PCR verification, using primers from neighbouring genomic DNA and ones 
from transposons (upstream insertion: 5’-GGTCGTGCATGACGAGATTA-3'/ 
5’-TACTATTCCTTTCACTCGCACTTATTG-3’; downstream insertion: 5’-TC 
CAAGCGGCGACTGAGATG-3'/5'-GTGCGTTCGAAGAAATGATG-3’). Flies 
with the mir-34 region deletion were viable, and were further verified for the appro- 
priate deletion by PCR amplification, with primers for mir-317 (5'-CGGAAA 
AACGGTTITGTGTCT-3'/5'-CCCGGGAACGAGTAAACGAAATGAAAATCA-3’), 
mir-277 (5'- TGATTTATGGTTTTTGTTTCAGTTG-3'/5'-TTGATATCATT 
TCACACTATCACAAAAATTGC-3'), mir-34 (5'-ACCTTGAGCGCTTCAAC 
TCT-3'/5'-CACTCTTTCTCGTTTGCATGG-3’) and dfmr1 (5'-CACACAGA 
GCTTCCCAGTGA-3'/5’-AGGCCCTCCTTTTTGACATT-3’). 

Fly age-associated phenotypes. Negative geotaxis and thermo stress were used to 
examine fly locomotion and stress resistance, respectively. To perform negative 
geotaxis, groups of 15 adult male flies of indicated age were transferred into a 
14-ml polystyrene round-bottom tube (Falcon), and placed in the dark for 30-min 
recovery. The assay was conducted in the dark, with only a red light on. Climbing 
ability was scored as the percentage of flies failing to climb higher than 1.5 cm from 
the bottom of the tube, within 15 s after gently being banged to the bottom. Three 
repeats were performed for each group and the result averaged. For each genotype 
at a given age, a minimum of 200 flies were tested. For heat sensitivity, groups of 15 
adult males of indicated age were transferred into 14-ml polystyrene round- 
bottom tubes (Falcon) then placed in a 25°C incubator for 30 min recovery. 
Heat stress was applied by immersing the vial containing the flies into a 37°C 
water bath for 1h, followed by a 30-min recovery at 25 °C, then another 1-h heat 
stress at 37 °C. Flies were then transferred into regular food vials and maintained at 
25 °C. Dead flies were counted after 24h. To assess brain morphology, adult male 
heads were processed for paraffin sections as described”, and brain vacuoles were 
counted through continuous sections generated from each head (n = 10 heads 
counted for each genotype). 

Molecular biology. Fly genomic DNA was prepared from whole flies with the 
Puregene DNA purification kit (Qiagen). To generate mir-34 pUAST constructs, 
PCR amplification was conducted using genomic DNA as template, with primer 
pairs of pUAST mir-34-I (286bp, PCR primer 5'-CCGTTACACACGACT 
ATTCTCAAT-3'/5'-CCATCTGATACAGGTCCTACATTTTCTAAAA-3’) and 
pUAST mir-34-II (936 bp, PCR primer 5’-ACCTTGAGCGCTTCAACTCT-3'/ 
5'-CACTCTTTCTCGTTTGCATGG-3’). PCR products were then ligated into 
the pUAST vector. mir-277/dfmr1 rescue construct was made in the pCaSpeR4 
vector, which contained two parts. Part 1 was a genomic DNA fragment (7,530 bp) 
harbouring the mir-277 sequence (PCR primers: 5’-GGTCGTGCATGACGAG 
ATTA-3'/5'-GGATGTTTTGCGACCAACTT-3’), and part 2 was a genomic 
fragment containing dfmr1 genomic sequence, derived from the pBS WTR con- 
struct (a gift from T. Jongens*’), by BamH1 and Ppuml. The mir-34 genomic 
rescue construct was also made in the pCaSpeR4 vector, with two parts. One 
was a genomic DNA fragment (6,855 bp) upstream of mir-277 sequence (PCR 
primers: 5'-GGTCGTGCATGACGAGATTA-3'/5'-GGATGCATTTTATCGTT 
AGGC-3’), and the other was a genomic DNA fragment (2,111 bp) containing 
mir-34 sequence (PCR primers: 5’-GCAGGAAAATGCGATAAATGA-3'/ 
5’-TCGTTACAACATGGAAATCCTC-3’). The resultant construct, therefore, 
contains mir-34 sequence, including most upstream fragment, with the exclusion 


of 108 bp of mir-277 sequence. In addition, a modified mir-34 genomic rescue 
construct was made (pCaSpeR4 vector), which contains same upstream and down- 
stream ends of the original mir-34 genomic rescue construct, with a small deletion 
of miR-277 mature sequence. The genomic regulation of mir-34 seems complex, as 
despite these standard manipulations for gene rescue, the genomic rescue expres- 
sion of mir-34 and extent of phenotypic rescue of mir-34 mutants was only partial. 
We attempted upregulation of mir-34 with the GAL4-UAS system, including with 
the conditional gene switch system in adults. Upregulation of mir-34 during 
development in non-germline tissues (when it normally is not expressed; Sup- 
plementary Fig. 2a) was deleterious, and we were unable to upregulate mir-34 
expression more robustly than with the genomic constructs. 

For western immunoblots, 10 adult male heads per sample were homogenized 
in 50 pl of Laemmli buffer (Bio-Rad) supplemented with 5% 2-mercaptoethanol, 
heated to 95 °C for 5 min and 10 ul loaded onto 4-12% Bis-Tris gels (NuPage), 
then transferred to nitrocellulose membrane (Biorad) and blotted by standard 
protocols. Primary antibodies used were anti-tubulin (1:10,000, E7, Develop- 
mental Studies Hybridoma Bank), anti-E74A (a gift of C. Thummel). Secondary 
antibodies for immunoblots were goat anti-mouse conjugated to HRP (1:2,000, 
Chemicon) and developed by chemiluminescence (ECL, Amersham). The final 
image was obtained by Fuji scanner (Fujifilm). 

Total RNA was isolated from 50-200 male heads per genotype, by cutting off 
heads with a sharp razor, then putting heads into Trizol reagent. Heads were 
ground by pestle, then RNA was isolated following the manufacturer’s protocol 
(Trizol reagent, Invitrogen). 5 ug RNA was used per lane. Gel running (1% agarose) 
and blot transfer (nylon plus) were according to recommended procedures 
(Northernmax, Ambion). The RNA blot was then used for hybridization following 
standard procedures at 68 °C, with pre-hybridization (~1 h), hybridization (~12h 
or overnight) with P**-labelled probe, washed and exposed to Phosphoimager 
(Amersham). RNA probes were used that were made by in vitro transcription of 
cDNA templates using Maxiscript-T7 in vitro transcription kit (Ambion), supple- 
mented with P**-labelled UTP. The cDNA templates were prepared from total 
RNA by one-step RT-PCR (SuperScript One-Step RT-PCR with Platinum Taq, 
Invitrogen), with primers: E74A (5'-GTGAACGTGGTGGTGGAAC-3'/ 
5'-GATAATACGACTCACTATAGGGAGATGTCCATTCGCTTCTCAATG-3’); 
E74B_ (5'-CATCGCTTGTCAATGTGTCC-3'/5'-GATAATACGACTCACTA 
TAGGGAGACTGCGGTAATCACTGAGCTG-3’);18S rRNA loading control 
(5'-GATAATACGACTCACTATAGGGAGA-3'/5'-AGGGAGCCTGAGAAAC 
GGCTACCACATCTAAGGAATCTCCCTATAGTGAGTCGTATTATC-3’). 

For small RNA northern blots, total RNA was isolated from male fly heads using 
Trizol reagent as above. For each lane, 3 tg of RNA was used, and RNA was 
fractionated on a 15% Tris-UREA gel (NuPage) with 1x TBE buffer. The blot 
transfer was performed with 0.5xTBE buffer. Before hybridization, the RNA blots 
were pre-hybridized with Oligohyb (Ambion), and then incubated with radioactive 
labelled RNA probes for ~12h to overnight. RNA probes were used, and made by 
in vitro transcription of oligo templates using Maxiscript-T7 in vitro transcription 
kit (Ambion), supplemented with P*? -labelled UTP. Oligo DNA templates were 
prepared by annealing two single-stranded DNA oligonucleotides into duplex 
(99°C, 5min and cool down to room temperature). Oligonucleotides used for 
mir-34 (5'-GATAATACGACTCACTATAGGGAGA-3'/5'-AAAAAATGGCA 
GTGTGGTTAGCTGGTTGTGTCTCCCTATAGTGAGTCGTATTATC-3’), mir- 
277 (5'-GATAATACGACTCACTATAGGGAGA-3'/5'-TAAATGCACTATCTG 
GTACGACATAAATGCACTATCTGGTACGACA TCTCCCTATAGTGAGT 
CGTATTATC-3’) and 2S rRNA (5'-GATAATACGACTCACTATAGGGA 
GA-3'/5'-TGCTTGGACTACATATGGTTGAGGGTTGTATCTCCCTATAGT 
GAGTCGTATTATC-3’). 

Luciferase assays were performed using standard approaches’. Specifically, 
8 X 10* DL1 cells were plated and bathed in 30 ul of serum-free medium with 
60 ng of dsRNA in each well of a 96-well plate. The next day, 1.6 ng of pMT-Firefly, 
400 ng of pMT-mir-34 and 400ng of pMT-renilla E74A wild-type or mutant 
3’ UTR reporters were transfected by Effectene (Qiagen). Two days after transfec- 
tion, the expression of the reporters and mir-34 was induced by CuSO,. Twenty- 
four hours after induction, luminescence assays were performed by the Dual-Glo 
Luciferase Assay System (Promega). The mir-34 seed sequences in the 3’ UTR of 
E74A were mutated as noted in Supplementary Fig. 3, using the Quik change 
mutagenesis system (Stratagene). Primers to knockdown Agol are described’. 

The miRNA-target prediction algorithms TargetScan (v5.1)*! and PicTar (fly)? 
were used to determine miR-34 target mRNA candidates. 
miRNA microarray analysis. For miRNA array analysis, Iso31 flies (isogenized 
w'T!8) were used. Flies were killed by brief submersion in ethanol under CO, 
anaesthesia, followed by two PBS washes (Sigma). To control for circadian effects, 
all flies were processed between 11:00 and 13:00. Brains were removed manually 
and collected in an Eppendorf microcentrifuge tube stored on ice. For each 
miRNA microarray replicate, 200-300 brains were collected for each time point, 
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with a ~50/50 ratio of males and females. RNA was prepared using the miRvana 
RNA extraction system (Ambion) yielding ~2.5 ug per 100 brains. RNA was 
eluted into 80 pl of RNase free water (Fisher Scientific) and stored at —80°C. 
miRNA profiling was carried out at the Penn microarray core facility using 
miRCURY LNA arrays (Exiqon) and protocols. Exiqon’s Hy3/H5-labelling kit 
was used (Exiqon). RNA samples were labelled with Hy3 and hybridized together 
with a Hy5-labelled common reference standard. The common reference standard 
consisted of equal amounts of RNA from brains of 3 days, 30 days and 60 days flies. 
The miRNA microarray data were analysed at the Penn Bioinformatics Core. Raw 
data was imported into Gene Spring 1.0 (Agilent) and normalized using a global 
LOESS regression algorithm (locally weighted scatterplot smoothing). Relative 
expression levels were calculated as the log, normalized signal intensity difference 
between the Hy3 and Hy5 intensity. Present/absent flagging was analysed by 
Exiqon (Exiqon). Expression levels (fold changes) for the 30 day and 60 day time 
point were calculated relative to the 3 day time point. The data sets were exported 
into Spotfire DecisionSite 9.0 (Tibco) for visualization and filtering. 

mRNA microarray analysis. For ageing microarray analysis, fly stock Iso31 was 
used. For mir-34 mutant microarray analysis, mir-34 null line-1 in 5905 back- 
ground was used, with fly 5905 line, as control. To generate an ageing profile, flies 
were aged to 3 days, 30 days and 60 days, and 30-50 brains dissected per time 
point, per replicate, as above (50-50 males and females). For each time point, three 
replicates were conducted. For mir-34 mutant microarray analysis, time points 
were 3 days and 20 days, and for each time point, 20 brains from male flies of the 
appropriate genotype were used, with five replicates in total. Microarray hybrid- 
ization and reading was performed at the Penn Microarray Core Facility. For 
mRNA microarrays, total RNA was reverse transcribed to ss-cDNA, followed 
by two PCR cycles using the Ovation RNA amplification system V2 (Ovation). 
Quality control on both RNA and ss-cDNA was performed using an 2100 Agilent 
Bioanalyzer (Quantum Analytics). The cDNA was labelled using the FL-Ovation 
cDNA Biotin Module V2 (Ovation), hybridized to Affymetrix Drosophila 2.0 chips 
(Affymetrix) and scanned with an Axon Instruments 4000B Scanner using 
GenePix Pro 6.0 image acquisition software (Molecular Devices). Affymetrix 
.cel (probe intensity) files were exported from GeneChip Operating Software 
(Affymetrix). The .cel files were imported to ArrayAssist Lite (Agilent) in which 
GCRMA probe-set expression levels and Affymetrix absent/present/marginal 
flags were calculated. Statistical analysis for those genes passing the flag filter 
was performed using Partek Genomics Suite (Partek). The signal values were 
log, transformed and a 2-way ANOVA was performed. 
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Transcriptional analysis of ageing status. We first used the wild type to extract 
age-associated probe sets and then compared the relative changes of these probe sets 
in a separate set of transcriptional profiles generated for the wild type and mir-34 
mutant. For transcriptional profiles of normal aged brains, the GCRMA package 
RMA (J. Z. Wu, J. MacDonald and J. Gentry, GCRMA: background adjustment 
using sequence information, R package version 2.14) for R/Bioconductor*’ was used 
to generate log, expression levels for probe-set IDs from the original .cel files. Then, 
a linear regression model was used to compute the significance of a correlation 
between age and gene expression’. This approach assumes a linear relationship 
between age and log, expression level: 


Yj = ut By Ajtai 


In this equation, Yj is the log, gene expression level of probe set i in sample j, A; is 
the age for individual j. The coefficients /,; is regression coefficients reflecting the 
rate of change in gene expression with respect to age. Probe sets with expression 
significantly correlated with age (P= 0.001 for /;) were determined. Then the 
same probe sets were used to estimate the relative expression in separate profiles of 
mir-34 mutants and age-matched controls. The average levels of each individual 
probe set were calculated for the difference between 20 day and 3 day, within the 
same genotype (that is, A20 day/A3 day) for each gene in controls and mir-34 
mutants, respectively. These differences were then compared between genotypes 
(that is, mir-34 mutants — controls). The significance of the difference between 
genotypes was analysed using a paired Wilcoxon test. The difference between 
control and mutant samples in positively correlated genes (Fig. 2d) is not by 
chance (P = 0.0001). 
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Maintenance of muscle stem-cell quiescence by 


microRNA-489 


Tom H. Cheung’, Navaline L. Quach'?, Gregory W. Charville!?’, Ling Liu'?, Lidia Park”, Abdolhossein Edalati’”, Bryan Yoo”, 


Phuong Hoang!” & Thomas A. Rando’*"*> 


Among the key properties that distinguish adult mammalian stem 
cells from their more differentiated progeny is the ability of stem 
cells to remain in a quiescent state for prolonged periods of time’”. 
However, the molecular pathways for the maintenance of stem-cell 
quiescence remain elusive. Here we use adult mouse muscle stem 
cells (satellite cells) as a model system and show that the microRNA 
(miRNA) pathway is essential for the maintenance of the quiescent 
state. Satellite cells that lack a functional miRNA pathway sponta- 
neously exit quiescence and enter the cell cycle. We identified 
quiescence-specific miRNAs in the satellite-cell lineage by micro- 
array analysis. Among these, miRNA-489 (miR-489) is highly 
expressed in quiescent satellite cells and is quickly downregulated 
during satellite-cell activation. Further analysis revealed that 
miR-489 functions as a regulator of satellite-cell quiescence, as it 
post-transcriptionally suppresses the oncogene Dek, the protein 
product of which localizes to the more differentiated daughter cell 
during asymmetric division of satellite cells and promotes the 
transient proliferative expansion of myogenic progenitors. Our 
results provide evidence of the miRNA pathway in general, 
and of a specific miRNA, miR-489, in actively maintaining the 
quiescent state of an adult stem-cell population. 

The miRNA pathway has been shown to be essential for stem-cell 
pluripotency, proliferation and differentiation**. To understand whether 
adult quiescent stem cells are under active post-transcriptional control 
by miRNAs, we conditionally ablated the miRNA processing enzyme 
Dicer in adult muscle stem cells, or satellite cells, using a mouse strain 
that expresses a satellite-cell-specific, tamoxifen-inducible Cre/loxP 
system? (Supplementary Fig. 1) and is homozygous for a floxed 
Dicer allele’ and a Cre-dependent yellow fluorescent protein (YFP) 
reporter’. Six days after the first tamoxifen injection to this conditional 
knockout strain, Dicer protein and miRNA levels were significantly 
downregulated in YFP-positive satellite cells (P< 0.001; Supplemen- 
tary Figs 2 and 3). Notably, in conditional knockout mice we detected 
YFP-positive satellite cells that had spontaneously exited quiescence 
and entered the cell cycle (Fig. 1a, b). In control mice, less than 1% of 
YFP-positive satellite cells were Ki67-positive at this time. These obser- 
vations suggest that an intact miRNA pathway is essential for the 
maintenance of satellite-cell quiescence. Deletion of Dicer also led to 
apoptosis of proliferating satellite-cell progeny (Fig. 1c, d and Sup- 
plementary Fig. 4). Together, these experiments demonstrate the 
essential role of miRNAs in the maintenance of satellite-cell quiescence 
and in the survival of proliferating myogenic progenitors. 

To assess the impact of miRNA pathway disruption on satellite-cell 
homeostasis, we quantified the number of satellite cells using single- 
fibre explants and mononuclear cells that were isolated from uninjured 
muscles of conditional knockout mice 2 weeks after tamoxifen injec- 
tions. We observed a marked reduction in satellite-cell number in the 
absence of Dicer (Fig. le, f). To confirm the functional loss of satellite 


cells, hindlimb muscles of tamoxifen-injected conditional knockout 
mice were injured to induce satellite-cell-mediated regeneration. 
Seven days after injury, very few regenerated fibres were observed in 
the conditional knockout mice, indicating severely impaired regenera- 
tion (Fig. 1g). Further analysis 6 months after injury revealed a marked 
reduction in the mass of injured muscles compared to the contra- 
lateral, uninjured muscles. By comparison, control mice exhibited a 
hypertrophic response after muscle injury (Fig. 1h). Consistent with 
the finding that adult muscle satellite cells have a low turnover rate®, 
uninjured muscle appeared in general to be normal 6 months after 
disruption of the Dicer gene (Supplementary Fig. 5a). However, the 
loss of satellite cells resulted in mild muscle-fibre atrophy in con- 
ditional knockout animals over time (Supplementary Fig. 5b). 

As the disruption of Dicer caused satellite cells to break quiescence 
and enter the cell cycle, we were interested in defining the role of 
specific miRNAs in maintaining the quiescent state. Quantitative 
real-time polymerase-chain-reaction (qRT-PCR)-based miRNA 
microarray analysis of highly purified quiescent satellite cells (QSCs) 
and activated satellite cells (ASCs) (Supplementary Fig. 6) revealed that 
351 miRNAs were differentially regulated during satellite-cell activa- 
tion (Supplementary Table 1). Of these, 22 were highly expressed in the 
quiescent state and markedly downregulated after satellite-cell activa- 
tion (Fig. 2a). Among the 22 quiescence-specific miRNAs, we focused 
on miR-489 because it is evolutionarily conserved among species’ and 
because it resides in intron 4 of the gene encoding calcitonin receptor 
(the Ctr gene; also known as Calcr) (Supplementary Fig. 7a), which is 
highly expressed in QSCs (Supplementary Fig. 7b, c) and has previ- 
ously been shown to regulate satellite-cell quiescence’®. Previous 
reports have suggested that intronic miRNAs co-express with host 
genes to co-regulate similar pathways''. The quiescence-specific 
expression of miR-489 and CTR was verified by qRT-PCR analysis 
(Fig. 2b, c). To determine whether miR-489 is specifically expressed in 
QSCs, we performed qRT-PCR analysis of isolated satellite cells and 
other mononuclear cell populations from uninjured muscle. As 
expected from the expression pattern of CTR (Supplementary Fig. 7c), 
miR-489 was highly enriched in QSCs relative to multinucleate muscle 
fibres or other mononuclear cells in the muscle (Fig. 2d, e). 

To test whether a sustained expression of miR-489 could lead to an 
impairment of muscle regeneration by suppressing satellite-cell activa- 
tion, an miR-489 expression plasmid was electroporated into hindlimb 
muscles in vivo. RT-PCR analysis revealed a high level of miR-489 
expression in tibialis anterior muscles electroporated with miR-489 
plasmid compared with the level in controls (Supplementary Fig. 8b). 
Six days after electroporation, control muscles exhibited normal 
regeneration, whereas muscles expressing miR-489 exhibited a severe 
defect in regeneration (Fig. 3a and Supplementary Fig. 8a). 

To test the hypothesis that overexpression of miR-489 suppresses 
muscle regeneration by maintaining satellite-cell quiescence and 
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Figure 1 | The miRNA pathway is essential for the maintenance of satellite- 
cell quiescence and survival of activated satellite cells. a, The tamoxifen 
(Tmx) injection scheme (black arrows) for conditional Dicer gene inactivation 
is shown (top). Each tick represents 1 day. Six days after the first injection, 
Ki67 YFP double-positive satellite cells were found (white arrows). Nuclei were 
stained with DAPI. b, Quantification of the percentage of YFP-positive cells 
that were Ki67-positive (Ki67*) in control and conditional knockout strain 
(cKO) mice (***P < 0.001). c, Six days after the first tamoxifen injection, 
muscles from control or cKO mice were analysed for apoptosis by staining for 
cleaved caspase 3. Nuclei were stained with DAPI. Inset in a and c, magnified 
view of the satellite cells in the full-size images. d, Quantification of the 
percentage of YFP-positive cells that were caspase-3-positive (Casp3") in 
control and cKO animals (***P < 0.001). e, Satellite-cell numbers were 
quantified on freshly isolated single fibres from control and cKO mice 2 weeks 
after tamoxifen injections (*P < 0.001). f, Satellite-cell numbers were 
quantified by FACS analyses of mononuclear cells from hindlimb muscles of 
control and cKO mice. Satellite cells are shown in orange in these representative 
FACS plots (See Supplementary Fig. 4). In four replicates, the percentage of 
satellite cells in total mononuclear cells in CKO muscles was markedly reduced 
(0.7%) compared to that in control muscles (3.0%). Blue, all other mononuclear 
cells. g, Tibialis anterior muscles from control or cKO mice were injured 

2 weeks after tamoxifen injection and cryosections were stained with 
haematoxylin and eosin 7 days after injury. h, Tibialis anterior muscles from 
control or cKO mice were injured 2 weeks after tamoxifen injection and 
collected 6 months after injury. Severe muscle loss was observed in injured 
muscles from cKO mice only (shown next to the contralateral, uninjured 
muscle for comparison). Error bars in b and d indicate s.e.m. 


suppressing activation, we overexpressed miR-489 or anti-miR-489 in 
fibre-associated QSCs ex vivo. Using syndecan 4 as a satellite-cell marker 
on fibre explants'*’*, we quantified the number of satellite cells on 
fibres 3 days after transfection. Satellite cells treated with anti-miR- 
489 exhibited similar proliferative activity as control satellite cells, 
whereas satellite cells treated with miR-489 exhibited markedly reduced 
proliferation (and no evidence of apoptosis) (Fig. 3b). Furthermore, 
fewer than 50% of the cells treated with miR-489 progressed through 
a single round of cell division over the course of the experiment as 
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Figure 2 | miRNA expression in purified QSCs and ASCs. a, miRNA 
expression profiling during satellite-cell activation using gRT-PCR-based 
miRNA arrays. QSCs from uninjured muscles and ASCs from injured muscles 
at indicated time points were isolated by FACS (Supplementary Fig. 6). QSC- 
specific mouse miRNAs are shown. The complete data set is shown in 
Supplementary Table 1. b, RT-PCR analysis of miR-489 transcript in QSCs 
and ASCs. Expression levels were normalized to snoRNA420. ***P < 0.001. 
c, RT-PCR analysis of CTR, Pax7 and myogenin mRNA. Expression levels 
were normalized to glyceraldehyde-3-phosphate dehydrogenase (GAPDH). 
* P< 0.001; **P < 0.01. d, RT-PCR analysis of miR-489 transcript in QSCs 
and all other mononuclear cells in hindlimb muscles. Expression levels were 
normalized to snoRNA420. *P < 0.05. e, RT-PCR analysis of miR-206 

and miR-489 transcript in QSCs and single-fibre explants. Expression levels 
were normalized to snoRNA420. ***P < 0.001; **P < 0.01. All error bars 
indicate s.e.m. 


determined by 5-ethynyl-2'-deoxyuridine (EdU) labelling (Fig. 3c). 
To test whether miR-489 regulates satellite-cell quiescence in a cell- 
autonomous manner, we used myogenic differentiation 1 (Myod; also 
known as Myod1) expression as an indicator of satellite-cell activa- 
tion'* and quantified the percentage of satellite cells expressing Myod 
48 h after miR-489 transfection. Consistent with the fibre-explant 
experiment, miR-489 suppressed satellite-cell activation (Fig. 3d). 
Together, these experiments demonstrate that miR-489 regulates 
satellite-cell quiescence in a cell-autonomous manner and that over- 
expression of a single miRNA is sufficient to prolong the quiescent 
state and delay QSC activation, resulting in an impairment of regen- 
eration in vivo. 

Next, we tested whether inhibition of miR-489 could result in the 
spontaneous activation of QSCs, which rarely divide in the absence of 
any activating stimuli®. Cholesterol-conjugated ‘antagomirs’ (ref. 15) 
that specifically target miR-489, or control scrambled antagomirs, 
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Figure 3 | miRNA-489 regulates satellite-cell quiescence. a, Hindlimb 
muscles were electroporated with either miR-489 expression plasmid (right) ora 
control miR-489 mutant plasmid (left). Muscles were collected 6 days later and 
haematoxylin and eosin staining was performed on cryosections. A 
representative image of three independent replicates is shown. b, miR-489 or 
anti-miR-489 was overexpressed in fibre-associated satellite cells. Three days 
after transfection, the number of syndecan-4-positive (Syn4 *) satellite-cell 
progeny was quantified. ***P < 0.001; NS, not significant. c, In studies such as 
those in b, EdU was added to the medium at the time of miR-489 (or control) 
transfection and the percentage of Syn4” cells that were EdU-negative (EdU_ ) 
was determined after 3 days. **P < 0.01. d, Left, FACS-sorted QSCs from 


were delivered systemically to adult mice. Four days after a single 
antagomir injection, miR-489 transcript levels decreased precipitously 
(Supplementary Fig. 9). In contrast to the control mice, which were 
injected with scrambled antagomirs, mice injected with anti-miR-489 
antagomirs exhibited spontaneous activation of QSCs that incorpo- 
rated EdU (Fig. 3e). Notably, inhibition of one quiescence-specific 
miRNA, miR-489, was sufficient to induce QSCs to break quiescence 
and progress through the cell cycle in uninjured muscle. 

The observation that inhibition of miR-489 induced satellite-cell 
activation and proliferation prompted us to test whether miR-489 func- 
tions to suppress one or more key regulators of proliferation, thereby 
maintaining the quiescent state. We used the bioinformatics tool 
TargetScan to search for miR-489 target genes that contain putative 


526 | NATURE | VOL 482 | 23 FEBRUARY 2012 


miR-489 


Scramble miR-489 


Myod YFP DAPI 


Scramble miR-489 
S 
3 KK 
Pax7 EdU DAPI 5 807 -——, 
o 
5 60 e®: ead 
a ee i 
2 
9 40 "e ee 
to) 
5 . ca 
= 20 
xe} 
SG (07 tree: T cae T 
3 - o a a 
8 3 8 
a 
Ee 2g & 2 
§ E § — 
% 5 ° iS) 
sg ivi 
= e 
<x <x 


Pax77E®’*+, ROSAS*P!* mice were plated and transfected with miR-489 and 
analysed for Myod expression 48 h later. Right, quantification of the percentage 
of YFP-positive cells that were Myod-negative (Myod_). Nuclei were stained 
with DAPI. *P < 0.05. e, Left, satellite-cell activation in vivo, as determined by 
EdU incorporation, was assessed in muscles in which miR-489 was inhibited by 
the systemic injection of a cholesterol-conjugated anti-miR-489 oligonucleotide 
(antagomir-489) or a scrambled antagomir (scramble). Pax7 EdU double- 
positive (arrows) and Pax7-positive cells (arrowheads) are highlighted. Right, 
quantitation of the number of EdU-positive (EdU") cells on cryosections. Two 
representative replicates of four independent experiments are shown (nuclei 
were stained with DAPI). ***P < 0.001. All error bars indicate s.e.m. 


miR-489 target sites in their 3’ untranslated regions (3’ UTRs)’. 
Among the 86 targets predicted by TargetScan, the transcript with 
the highest context score’® was the oncogene Dek (Supplementary 
Fig. 10), which has been shown to be induced in tumour cells and to 
regulate cell proliferation and messenger RNA splicing’”’*. We 
analysed the temporal expression of Dek mRNA and protein during 
satellite-cell activation. Using paired box protein 7 (Pax7) as a marker 
of QSCs and Myod as a marker of ASCs’*”®, we found that Dek protein 
was not expressed in QSCs but was strongly upregulated after satellite- 
cell activation both in fibre-explant studies ex vivo and in regeneration 
studies in vivo (Fig. 4a and Supplementary Fig. 1la—c). Likewise, Dek 
mRNA levels were higher in ASCs compared to QSCs (Supplementary 
Fig. 11d). 
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Figure 4 | Targeting of Dek mRNA by miR-489 and regulation of cell-fate 


decision of satellite-cell progeny by Dek. a, Co-localization of Dek and 
myogenic markers in fibre-associated satellite cells. Fibre explants were fixed 
immediately after isolation (Day 0) or cultured for 3 days in suspension and 
stained for expression of Pax7 and Dek, or Myod and Dek, as indicated. Nuclei 
were stained with DAPI. b, CMV-miR-489 was co-transfected into 293T cells 
with wild-type (WT) or mutant Dek 3’ UTR constructs inserted after the stop 
codon of a luciferase gene. The Dek 3’ UTR carries three putative miR-489 
target sites (489, 489, and 489;) (putative pairing as shown in Supplementary 
Fig. 10). Schematics of wild-type and mutant constructs (m1, m2 and m3) are 
shown with the relative luciferase activities associated with each construct. 
*P < 0.001; *P < 0.05; NS: not significant. ¢, Satellite cells in fibre explants 
were transfected with Dek short interfering RNA (siRNA) (siDek) and cultured 
for 3 days, and the satellite-cell progeny were quantified by syndecan 4 staining 
(n = 3).***P < 0.001. d, FACS-purified QSCs were plated and transfected with 
miR-489 or siDek for 48h. EdU was added to the medium at the same time as 
transfection. Cells were stained for EdU incorporation and Myod expression. 
Bar graphs show the proportion of cells expressing each marker under each 
condition. e, Dek asymmetrically localizes to one daughter after cell division. 
Fibre-associated satellite cells were cultured for 48 h and stained for expression 
of Myod and Dek. Nuclei were stained with DAPI. f, The timeline for injury, 
EdU injections and collection of cells is shown (top). Cells were stained for EdU 
incorporation to reveal nonrandom template-strand segregation and for Myod 
expression to reveal divergent cell fates. Dek co-segregates almost exclusively 
with the newly synthesized template strands. Images show a representative 
example of a cell pair exhibiting divergent cell fates with asymmetric 
segregation of template strands. Nuclei were stained with DAPI. g, Quantitative 
analysis of concordant and discordant asymmetries of Dek and EdU in 
asymmetric satellite-cell divisions in studies such as those in f. **P < 0.001. All 
error bars indicate s.e.m. 


Dek protein was downregulated when QSCs or myoblasts were 
transfected with miR-489 (Supplementary Fig. 12), suggesting that 
Dek is a direct target of miR-489. To test this directly, wild-type and 
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mutant versions of the 3’ UTR of Dek were cloned downstream of a 
luciferase reporter, and these reporter constructs were co-transfected 
with an miR-489 expression construct into 293T cells. The wild-type 
Dek 3' UTR was effectively downregulated by miR-489 (Fig. 4b). 
Although TargetScan analysis revealed three potential target sites for 
miR-489, a single site (m2) was sufficient to account for the suppres- 
sion of reporter expression by miR-489 (Fig. 4b). 

We next examined the role of Dek in satellite-cell quiescence and 
activation using a loss-of-function approach. Dek knockdown reduced 
satellite-cell proliferation (Fig. 4c) and prevented satellite-cell activa- 
tion to the same degree as did miR-489 overexpression (Fig. 4d). The 
ability of Dek knockdown to phenocopy the effect of miR-489 over- 
expression suggests a central role of Dek in regulating satellite-cell exit 
from quiescence. To understand whether miR-489 overexpression sup- 
presses proliferation by regulating Dek expression, we overexpressed 
miR-489 or miR-489 mutant with a Dek complementary DNA con- 
struct that lacks its 3’ UTR in proliferating myoblasts. Overexpression 
of miR-489 alone reduced cell proliferation, whereas overexpression of 
Dek substantially increased cell proliferation independent of the 
expression of miR-489 or miR-489 mutant (Supplementary Fig. 13). 
Together, these experiments suggest that Dek is an important target of 
miR-489 that is involved in the regulation of satellite-cell quiescence 
and activation. 

Although Dek expression was highly induced after satellite-cell 
activation, consistent with its role in proliferative expansion of the 
transit-amplifying myogenic progenitors, it was absent in self-renewed 
satellite cells after muscle injury in vivo (Supplementary Fig. 11c). We 
therefore studied satellite-cell self-renewal in fibre explants ex vivo, in 
which the asymmetric expression of Myod by daughter cells heralds a 
divergent cell fate whereby the Myod-positive daughter progresses 
along the myogenic lineage and the Myod-negative daughter renews 
the satellite-cell population”. Intriguingly, in such pairs, we observed 
asymmetric Dek expression, in which Dek expression coincided with 
Myod expression in the same daughter cell (Fig. 4e). This co-localization 
suggests that the Dek-positive daughter is destined for proliferative 
amplification as a progenitor and that the Dek-negative daughter is 
destined for self-renewal. To test whether the process of self-renewal is 
associated with the absence of Dek, we examined cells undergoing 
asymmetric division by analysing nonrandom chromosome segrega- 
tion, a process that we and others have previously shown to distinguish 
the differentiating progenitor from the self-renewing stem cell*’”. 
Consistent with the Myod asymmetry, we found that Dek was absent 
in the daughter cell inheriting chromosomes bearing older template 
DNA strands, an inheritance pattern that is characteristic of the self- 
renewing cell, whereas Dek was expressed in the daughter cell that is 
destined for proliferative amplification and differentiation (Fig. 4f, g 
and Supplementary Fig. 14). 

The finding that Dek is a key target of miR-489 in maintaining 
quiescence provides insight into the molecular pathways that regulate 
the quiescent state. These data demonstrate that the molecular regu- 
lation of quiescence is dependent on the expression of specific miRNAs 
and is integrated in the signalling network that regulates divergent 
fates of stem-cell progeny during asymmetric cell division. 


METHODS SUMMARY 

Single-fibre explants. Extensor digitorum longus (EDL) muscles were excised and 
digested in Collagenase II (500 units per ml in Ham’s F10 medium) as previously 
described”. Fibres were then washed extensively and cultured in medium contain- 
ing Ham’s F10, 10% horse serum and 0.05% chick embryo extract. Every 24h, 50% 
of the medium was replaced with Ham’s F10 medium with 20% FBS. Extensor 
digitorum longus (EDL) fibres were cultured in suspension. Fixed fibres were 
stained and the number of satellite cells was quantified per fibre. 

Satellite-cell isolation and fluorescence-activated cell sorting. Hindlimb 
muscles were dissected and dissociated to yield a muscle suspension and digested 
with Collagenase II (500 units per ml; Invitrogen) in Ham’s F10 medium with 10% 
horse serum (Invitrogen) for 90 min. Digested fibre suspensions were washed and 
digested further with Collagenase II (100 units per ml) and Dispase (2 units per ml; 
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invitrogen) for 30 min. Digested fibre suspensions were triturated and washed 
further to yield a mononuclear-cell suspension for cell-surface staining for 
fluorescence-activated cell sorting (FACS). Mononuclear cells were stained with 
Vcam-biotin (clone 429; BD Bioscience), CD31-APC (clone MEC 13.3; BD 
Bioscience), CD45-APC (clone 30-F11; BD Bioscience) and Sca-1-Pacific-Blue 
(clone D7; Biolegend) at 1:75. Streptavidin-PE-cy7 was used to amplify the 
Vcam signal (BD Biosciences, 1:75). Cell sorting was performed using a BD 
FACSAria I or BD FACSAria III cell sorter equipped with 488-nm, 633-nm 
and 405-nm lasers. The machine was optimized for purity and viability, and sorted 
cells were subjected to FACS analysis directly after sorting to ensure purity. A small 
fraction of sorted cells was plated and stained for Pax7 and Myod to assess the 
purity of the sorted population purity. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animals. C57BL/6, ROSAS??? and Der?" mice were obtained from 
Jackson Laboratory®’. Pax7**=® Cre mouse was provided by C. Keller. Tamoxifen 
injection for Cre recombinase activation was performed as described previously”. 
Unless indicated, all control animals used in this study carried the genotype Pax7 aa 
Der ?"9*P. ROSA26*!* and all conditional knockout strain animals carried the 
genotype Pax7*F™!*; Derl*P"?; ROSA26*!*. In Fig. la- d and Supplementary 
Fig. 3, control and conditional knockout strain animals are mice that carry the 
genotypes Pax7ER/*, Der*!*; ROSA26° YF P/eYEP and Pax7R'*, Der ho. 
ROSA26°°F PYF, respectively. To control for tamoxifen injection toxicity, we 
injected all mice with tamoxifen. Mice were housed and maintained in the 
Veterinary Medical Unit at Veterans Affairs Palo Alto Health Care Systems. 
Animal protocols were approved by the Administrative Panel on Laboratory 
Animal Care of Stanford University. 

Satellite-cell isolation and FACS. Hindlimb muscles were dissected and disso- 
ciated to yield a fragmented muscle suspension using gentleMACS dissociator 
(Miltenyl Biotec). The muscle suspension was then digested with Collagenase II 
(500 units per ml; Invitrogen) in Ham’s F10 medium containing 10% horse serum 
(Invitrogen) for 90 min. Fragmented myofibres were washed and digested further in 
Collagenase II (100 units per ml) and Dispase (2 units per ml; Invitrogen) for 
30 min. Digested-fibre suspensions were triturated and washed to yield a mono- 
nuclear cell suspension. Mononuclear cells were stained with Vcam-biotin (clone 
429; BD Bioscience), CD31-APC (clone MEC 13.3; BD Bioscience), CD45-APC 
(clone 30-F11; BD Bioscience) and Sca-1-Pacific-Blue (clone D7; Biolegend) at 1:75. 
Streptavidin-PE-cy7 was used to amplify the Vcam signal (BD Biosciences, 1:75). 
Cell sorting was performed using a BD FACSAria II or BD FACSAria III cell sorter 
equipped with 488-nm, 633-nm and 405-nm lasers. The machine was carefully 
optimized for purity and viability, and sorted cells were subjected to FACS analysis 
directly after sorting to ensure purity. A small fraction of sorted cells was plated and 
stained for Pax7 and Myod to assess the purity of the sorted population. 
Injections and electroporation. Mice were anaesthetized using isoflurane through 
a nose cone. Muscle injury was induced by injecting 1-2 pl of 1.2% BaCl, into 
approximately 25 sites in the lower hindlimb muscles. Electroporation of plasmid 
DNA into the tibialis anteriormuscle was performed as described previously~* using 
atwo-needle electrode array at a setting of 5 pulses of 50 ms duration at 150 Vcm"'. 
Antagomir molecules were injected into tail veins of 8-week-old mice at a dose of 
8mgkg | body weight. 

Antagomir synthesis. PAGE-purified RNAs were synthesized with modifications 
(Dharmacon). Sequences of single-stranded RNAs used in this study are as follows 
(*, phosphorothioate backbone at given position; Chl, cholesterol linked through a 
hydroxyprolinol linkage; m, 2'OMe-modified nucleotides): antagomir-489, 
5’mG*mC*mUmGmCmCmAmUmAmUmAmUmGmUmGmGmUmGmUm 
C*mA*mU*mU*3'-Chl; scramble, 5’mU*mU*mUmCmUmAmAmUmCmAm 
AmGmGmGmUmCmUmGmUmG*mG*mC*mU*3’-Chl. 

Histology and immunohistochemistry. For haematoxylin and eosin staining, 
tibialis anterior muscles were dissected and directly frozen in OCT (Tissue- 
Tek). For immunohistology, tibialis anterior muscles were fixed for 5h using 
0.5% electron-microscopy-grade paraformaldehyde and subsequently transferred 
to 20% sucrose overnight. Muscles were then frozen in OCT, cryosectioned with a 
thickness of 6 {ym and stained using an M.O.M kit (Vectorlabs) or a Zenon label- 
ling kit (Invitrogen) according to the manufacturers’ instructions. 

miRNA and siRNA transfections. Approximately 40 fibres were placed in each 
well of a 6-well plate containing 1 ml of Ham’s F10, 10% horse serum and 0.5% 
chicken embryo extract (US Biological). 100nM of miR-489 or anti-miR-489 
synthetic molecules (Ambion) were transfected into either freshly isolated single 
fibre explants or C2C12 cells using Lipofectamine 2000 (Invitrogen). Cells were 
collected for western blot 48 h after transfection. Control (cyclophilin B) and Dek 
siRNAs (Dharmacon) were dissolved and diluted as suggested by the manufac- 
turer. Lipofectamine 2000 (Invitrogen) was used for the transfection of Dek siRNA 
according to the manufacturer’s instructions. 

Single-fibre explants. EDL muscles were excised and digested in Collagenase II 
(500 units per ml in Ham’s F10 medium) as previously described’. Fibres were 
then washed extensively and cultured in medium containing Ham’s F10, 10% 
horse serum and 0.05% chick embryo extract. Every 24h, 50% of the medium 
was replaced with Ham’s F10 medium with 20% FBS. EDL fibres were cultured in 
suspension. Fixed fibres were stained and the number of satellite cells was quan- 
tified per fibre. 

RT-PCR and miRNA microarray. Total RNA was isolated using Trizol 
(Invitrogen). For individual RT-PCR, Taqman probes were used for detecting 
miR-17, miR-27b, miR-206, miR-489, sno420, Gapdh, Ctr, Pax7 and myogenin 
mRNA expression (Applied Biosystems). For miRNA microarrays (Applied 
Biosystems), reverse transcription and amplification was performed as described 
by the manufacturer. Diluted cDNAs were loaded onto the Taqman Array Rodent 
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MicroRNA A+B Cards Set v2.0 and qRT-PCR analysis was performed using an 
ABI 7900HT Fast Real-Time PCR System. miRNA gene expression was normalized 
to U6 small nuclear RNA. Relative quantitation of miRNA gene expression was 
performed using the delta delta CT method”’. Data have been deposited at NCBI 
Gene Expression Omnibus under the accession number GSE26780. 

DNA cloning and luciferase assay. A 300-base-pair genomic fragment flanking 
pre-miR-489 was cloned from mouse genomic DNA with the 5’ primer 
CCCCATGAGGGCAGAAACCAT and the 3’ primer TTATGATGCAACAAAT 
ATAT. The fragment was then sub-cloned into pGEM-T-Easy (Promega) and 
inserted into pcDNA3.1 plasmid to generate CMV-miR-489 plasmid. To generate 
the miR-489 mutant plasmid, four point mutations were introduced into the miR- 
489 plasmid using the following primers: 489_m1- 5’ primer CTGCAGT 
GGCAGCTTGGTTTTCATATCTGTAATGATACTTTCTAAAGTCTTCCAG, 
3’ primer CTGGAAGACTTTAGAAAGTATCATTACAGATATGAAAACCAA 
GCTGCCACTGCAG, 489_m2-5' primer CTTTCTAAAGTCTTCCAGAATA 
ACACTACAGATATGGAAGCTAAACTGTTACATGGAACAAC, 3’ primer 
GTTGTTCCATGTAACAGTTTAGCTTCCATATCTGTAGTGTTATTCTGG 
AAGACTTTAGAAAG, 

These inserts (CMV-miR-489 and CMV-miR489-mutant) were then sub- 
cloned into pMR-Zsgreen1 to generate plasmids containing a ZsGreen reporter. 

The Dek 3’ UTR was cloned by amplifying the region of the Dek 3’ UTR that 
contains miR-489-binding sites from mouse genomic DNA using the 5’ primer 
AAGTGACAGATGTTATTTTT and the 3’ primer AACATTGATTTATTCTT 
TAT. The Dek UTR luciferase construct was generated by inserting this fragment 
into pMIR-report plasmid (Ambion). Dek mutants (ml, m2 and m3) were 
generated using the QuikChange II site directed mutagenesis kit (Stratagene). 
For each putative miR-489 site, two point mutants were introduced to the seed 
sequence using the following primers: m1-5' primer GITTCTGCTTTGCCC 
TCAAAGTATAATCAATGTGGTTGTG, 3’ primer CACAACCACATTGATT 
ATACTTTGAGGGCAAAGCAGAAC, m2-5’ primer GTCATCAATGTGGTT 
GTGTTAACTCTAAGTATAATAGAAATTTTATAATGAGG, 3’ primer CCT 
CATTATAAAATTTCTATTATACTTAGAGTTAACACAACCACATTGATG 
AC, m3-5’ primer GTTGGCCTTTAAGCAATTTATAATAAATCTTCACAAT 
AAAGAATAAATC, 3’ primer GATTTATTCTTTATTGTGAAGATTTATTAT 
AAATTGCTTAAAGGCCAAC, 

Luciferase assays were performed by seeding 5 X 10° cells per well in 6-well 
plates. Cells were then transfected with 0.25 jg of 3’ Dek UTR constructs, 0.75 ig 
of the miR-489 expression construct and 50 ng of the pRL-TK Renilla luciferase 
control vector. Cells were transfected using FuGENE 6 according to the manu- 
facturer’s instructions. Forty-eight hours after transfection, cells were lysed and 
luciferase activities were measured using the Dual Luciferase Assay System 
(Promega) with a 20/20n luminometer (Turner Biosystems). 

The mouse pCMV-Sport6 Dek plasmid was purchased from Open Biosystems. 
The pCMV-Sport6 Dek deltaUTR construct was made by excising the Dek 3’ UTR 
using restriction enzymes BglII and Not I and re-ligated to generate a Dek expres- 
sion plasmid without its 3’ UTR. 

Template-strand analysis. Analysis of nonrandom template-strand segregation 
was performed as described with several modifications”. Briefly, muscles of 
8-week-old mice were injured as described and 200 1g of EdU (Invitrogen) were 
injected intraperitoneally 48 h and 52 h after injury. Satellite cells were then sorted 
using the scheme as described and plated on poly-1-lysine-treated chamber slides 
(BD Biosciences) coated with extracellular matrix gel (Sigma) diluted at 1:100 in 
DMEM medium. To facilitate the analysis of sister-cell pairs, sorted cells were 
plated at very low density (~10 cells per mm’). After allowing cells to adhere for 
1h, cultures were treated with cytochalasin D (5 1M; Sigma) to prevent cytokinesis. 
Cells were fixed and stained using the Click-iT EdU Imaging Kit (Invitrogen) and 
antibodies recognizing Dek or Myod. Sister-cell pairs were identified as two nuclei 
less than one-cell-diameter apart with contiguous cytoplasm that was evident using 
brightfield microscopy. Between 200 and 250 cell pairs were scored per experiment 
and all experiments were performed in triplicate. 

Western blot analysis. Muscle tissues and cells were extracted in lysis buffer 
(50 mM Tris-HCl, pH 7.5, 0.5% SDS, 20 jig ml aprotinin, 20 pg ml! leupeptin, 
10 pg ml“! phenylmethylsulfonyl fluoride, 1 mM sodium orthovanadate, 10 mM 
sodium pyrophosphate, 10 mM sodium fluoride and 1 mM dithiothreitol). Protein 
extracts were subjected to electrophoresis on 4-15% polyacrylamide gradient gels 
and then transferred to nitrocellulose membranes. The membranes were 
incubated in blocking buffer (PBS and 5% milk) before overnight incubation with 
primary antibodies. After incubation with corresponding fluorescent secondary 
antibodies (Invitrogen), the membranes were analysed using the Odyssey imaging 
system (LI-COR). Glyceraldehyde-3-phosphate dehydrogenase or actin was used 
as a loading control. 

Statistical analysis. All statistical analyses were performed using GraphPad Prism 
5 (GraphPad Software). Unless otherwise noted, all error bars represent s.e.m. 
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Immunofluorescence and antibodies. Immunofluorescence was performed 
using a Zeiss Observer Z1 fluorescent microscope (Zeiss) equipped with a 
Hamamatsu Orca-ER camera or a Zeiss confocal system LSM710 (Zeiss). 
Data acquisition and fibre-diameter measurements were performed using 
Improvision Volocity software (Perkin Elmer) or Zeiss LSM ZEN software 
(Zeiss). 

Antibodies. The antibodies used in this study were Pax7 (DSHB, 1:100), Ki67 
(Abcam, 1:100 and BD Bioscience, 1:50), laminin (Sigma, 1:1,000), cleaved caspase3 


(Cell signaling, 1:100), Myod (Dako, 1:1,000), green fluorescent protein (GFP) 
(Invitrogen, 1:250 and Abcam, 1:250), Dek (Proteintech Group, 1:2,000) and syn- 
decan 4 (gift from Bradley Olwin, 1:1,000). 


24. Bertoni, C. et al. Enhancement of plasmid-mediated gene therapy for muscular 
dystrophy by directed plasmid integration. Proc. Natl Acad. Sci. USA 103, 419-424 
(2006). 

25. Pfaffl, M.W.A new mathematical model for relative quantification in real-time RT- 
PCR. Nucleic Acids Res. 29, e45 (2001). 
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Medulloblastoma, the most common malignant paediatric brain 
tumour, arises in the cerebellum and disseminates through the cere- 
brospinal fluid in the leptomeningeal space to coat the brain and 
spinal cord’. Dissemination, a marker of poor prognosis, is found 
in up to 40% of children at diagnosis and in most children at the time 
of recurrence. Affected children therefore are treated with radiation 
to the entire developing brain and spinal cord, followed by high-dose 
chemotherapy, with the ensuing deleterious effects on the developing 
nervous system’. The mechanisms of dissemination through 
the cerebrospinal fluid are poorly studied, and medulloblastoma 
metastases have been assumed to be biologically similar to the 
primary tumour**. Here we show that in both mouse and human 
medulloblastoma, the metastases from an individual are extremely 
similar to each other but are divergent from the matched primary 
tumour. Clonal genetic events in the metastases can be demon- 
strated in a restricted subclone of the primary tumour, suggesting 
that only rare cells within the primary tumour have the ability to 
metastasize. Failure to account for the bicompartmental nature of 
metastatic medulloblastoma could be a major barrier to the 
development of effective targeted therapies. 

Thirty percent of patched-1-heterozygous (Ptch*'~) mice develop 
non-disseminated medulloblastoma by 8 months of age’. Recently, the 
Sleeping Beauty (SB) transposon system was shown to be an effective 
tool for functional genomics studies of solid tumour initiation and 
progression®’. We expressed the SB11 transposase in cerebellar 
progenitor cells in transgenic mice under the Math1 (also known as 
Atoh1) enhancer/promoter, but we did not observe any tumours when 
these mice were bred with mice transgenic for a concatemer of the T2/ 
Onc transposon* (Fig. la—j and Supplementary Figs 1 and 2). However, 
on a Ptch*'~ background, these Math1-SB11/T2Onc mice showed 
increased penetrance of medulloblastoma (~97%; 271 of 279 mice) 
compared with controls (~39%; 54 of 139 mice), as well as decreased 
latency (2.5 months compared with 8 months) (Fig. 1 and Supplemen- 
tary Fig. 2). Although Ptch*/~ medulloblastomas are usually localized, 
the addition of SB transposition results in metastatic dissemination 
through the cerebrospinal fluid pathways, identical to the pattern that 


is seen in human children (Fisher’s exact test, P= 1.8 X 10”, odds 
ratio = 5.2; Supplementary Table 1) (Fig. 1c, dand Supplementary Fig. 2). 
As neither transposon nor transposase alone had an effect on tumour 
incidence, latency or dissemination, we conclude that SB-induced 
insertional mutagenesis drives medulloblastoma progression on the 
Ptch*’~ background (Fig. 1i and Supplementary Fig. 2). 

Humans with germline mutations in the tumour-suppressor gene 
TP53 have Li-Fraumeni syndrome and have an increased risk of 
developing medulloblastoma. Although no medulloblastomas were 
found in mice with mutant Tp53 (also known as Trp53) (denoted 
Tp53™" mice, which includes Tp53*/~ and Tp53 ‘~), 40% of 
Tp53™/Math1-SB11/T2Onc mice developed disseminated medullo- 
blastoma’ (Fig. le-h, j and Supplementary Fig. 2). Human medullo- 
blastomas with TP53 mutations frequently have large cell/anaplastic 
histology. Tp53™"/Math1-SB11/T2Onc medulloblastomas have large 
cells, nuclear atypia and nuclear moulding that is typical of large cell/ 
anaplastic histology (Fig. 1f). We conclude that SB transposition can 
drive the initiation and progression of metastatic medulloblastoma on 
a Tp53™ background. 

We used linker-mediated PCR and 454 sequencing to identify the 
site of T2/Onc insertions in Ptch*'~/Math1-SB11/T2Onc and 
Tp53™™/Math1-SB11/T2Onc primary medulloblastomas and their 
matched metastases. Genes that contained insertions statistically more 
frequently than the background rate were identified as gene-centric 
common insertion sites (gCISs)'®. We identified 359 gCISs in 139 
primary tumours on the Ptch*’~ background and 26 gCISs in 36 
primary medulloblastomas on the Tp53™ background (Supplemen- 
tary Tables 2-7 and Supplementary Figs 3-5). A large number of gCISs 
were candidate medulloblastoma oncogenes or tumour-suppressor 
genes'’ (Supplementary Table 8). Insertions in candidate tumour- 
suppressor genes, including Ehmt1, Crebbp and Mxil, are predicted 
to cause a loss of function (Fig. 1k-m), whereas insertions in putative 
medulloblastoma oncogenes are largely gain of function, as exemplified 
by Myst3 (Fig. 1n). 

Many gClISs mapped to regions of amplification, focal hemizygous 
deletion and homozygous deletion (Supplementary Table 8) that we 
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recently reported in the genome of a large cohort of human medullo- 
blastomas’’. There is a high level of overlap between gCISs and known 
cancer genes (in the Catalogue of Somatic Mutations in Cancer 
(COSMIC) database) (Supplementary Tables 9 and 10), suggesting 
that many gCISs are bona fide driver genes in medulloblastoma 
(Fisher’s exact test, P= 0.0012)’. Similarly, many of the mouse 
gClSs and the genes amplified in human medulloblastomas are over- 
expressed in human SHH-driven medulloblastomas (Supplementary 
Fig. 6). Conversely, mouse gCISs hemizygously deleted in human 
medulloblastomas were frequently expressed at a lower level in human 
medulloblastomas (Supplementary Fig. 6). The expression of six out of 
seven gCISs that had been studied by immunohistochemistry on a 
human medulloblastoma tissue microarray was associated with sig- 
nificantly worse overall and progression-free survival in human 
medulloblastoma’* (Supplementary Table 11 and Supplementary 
Figs 7 and 8). We conclude that our SB-driven leptomeningeal- 
disseminated medulloblastoma model resembles the human disease 
anatomically, pathologically and genetically and thus is an accurate 
model of the human disease that can be used to identify candidate driver 
events and understand the pathogenesis of human medulloblastoma. 

We compared the gCISs identified from Ptch*’~/Math1-SB11/ 
T2Onc and Tp53™"'/Math1-SB11/T2Onc primary medulloblastomas 
and matched metastases (Supplementary Table 2). Strikingly, the over- 
lap between primary tumour gCISs (pri-gCISs) from Ptch *’~ /Math1- 
SB11/T2Onc tumours and those from metastases (met-gCISs) from 
the same animals was only 9.3% of all gCISs (Fig. 2a). Similarly, the 
overlap between pri-gCISs from Tp53™/Math1-SB11/T2Onc mice 
and the matching met-gCISs was only 8.9% (Fig. 2b). The leptomeningeal 
metastases and the matched primary tumour have identical, highly 
clonal insertion sites on both genetic backgrounds (Fig. 2c). The 
probability of two (or three) unrelated tumours having SB insertions 
in exactly the same TA dinucleotide is extremely low. We conclude that 
the leptomeningeal metastases and the matched primary tumour arise 
from a common transformed progenitor cell and have subsequently 
undergone genetic divergence. 

Sequencing also identified insertions that are highly clonal in the 
metastases but are not observed in the matched primary tumour (data 
not shown). End-point PCR for these insertions in the matched primary 
and metastatic tumours shows that the insertion is highly clonal in the 
metastasis (or metastases) and is present in a very small subclone of the 
primary tumour (Fig. 2d and Supplementary Fig. 9). These data are 
consistent with a model in which metastatic disease arises from a minor 
restricted subclone of the primary tumour. Dissemination could occur 
repeatedly from the same subclone of the primary tumour, which seeds 
the rest of the central nervous system, or it could occur once, followed 
by reseeding of the rest of the leptomeningeal space by the initial 
metastasis. Insertions that are restricted to a minor subclone of the 


Figure 1 | Transposon mutagenesis models of disseminated human 
medulloblastoma. a-d, The histology of transposon-driven medulloblastoma 
on the Ptch*/~ background resembles human medulloblastoma, with 
leptomeningeal metastases on the surface of the brain (c) and spinal cord 

(d). Images show haematoxylin and eosin staining (a, entire brain; b, upper 
spinal cord). e-h, The histology of transposon-driven medulloblastoma on the 
Tp53™ background shows histological features of large cell/anaplastic 
medulloblastoma, including nuclear pleomorphism and nuclear wrapping 

(f). Dissemination to the leptomeningeal spaces of the brain (g) and spinal cord 
(h) also occurs on this background. i, Ptch*/~ mice with SB transposition 
develop more frequent medulloblastomas with a shorter latency than Ptch*!~ 
mice without transposition. P values are from t-tests of survival comparing 
individual genotypes to Ptch*/~ mice; n, number of mice per genotype. mo., 
months. j, Medulloblastoma (MB) was not observed in Tp53™"* mice without 
transposition but was observed in 42% of Tp53™™' mice with transposition. P 
values are from f-tests comparing survival between Tp53™™' mice and Tp53™'/ 
SB11/T2Onc mice with MB; n, number of mice. kn, Insertion maps of notable 
gCISs. Insertions in the direction of transcription are denoted by green arrows, 
and those against the direction of transcription are denoted by red arrows. 
Transcription start sites are denoted by black arrows. 
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Figure 2 | Transposon-driven metastatic medulloblastoma genetically 
differs from the primary tumour. a, b, Venn diagrams depicting the degree of 
overlap and discordance in the gCISs in primary tumours and metastases, on 
the Ptch*/~ and Tp53™™ backgrounds. c-f, Insertion-site end-point PCR was 
used to demonstrate the relative clonality of insertions between samples. Data 
for medulloblastoma in five mice are shown (mouse 143, left; and four mice, 
right). Three levels of input DNA were used for each sample (1X, 5 and 25x, 
with the increase depicted by a wedge). Shown are clonal events found in both 
the primary tumour and matching metastases (met) (c), insertions that are 
highly clonal in the metastases but very subclonal in the matching primary 
tumour (d), insertions that are highly clonal in the metastases but undetectable 
in the matching primary tumour (e), and insertions that are highly clonal in the 
primary tumour but undetectable in the matching metastases (f). NC, negative 
control; genomic DNA from a Math1-SB11/T2Onc double-transgenic mouse 
cerebellum. 


primary tumour but that are clonal in the metastases could correspond 
to the previously described ‘metastasis virulence’ genes”, which offer a 
genetic advantage during dissemination but not to the primary 
tumour. Another explanation for our data could be that the primary 
tumour was reseeded by a metastatic clone that had acquired addi- 
tional genetic events in the periphery. This hypothesis is mitigated by 
the presence of highly clonal insertions in the metastasis that are com- 
pletely absent from the primary tumour in the same animal’® (Fig. 2e). 
As reseeding should be accompanied by contamination of the primary 
tumour with events found in the metastases, the absence of these events 
in the matched primary tumour makes reseeding much less likely 
(Fig. 2e). We propose that events found in only one metastasis repres- 
ent progression events that are acquired post metastasis and that could 
lead to localized progression of metastatic disease, as is sometimes 
observed in human children. 

We observed highly clonal insertions in the primary tumour, 
including in known medulloblastoma oncogenes such as Notch2 and 
Tert, that were not found in the matching metastases (Fig. 2f). This 
pattern could be explained through remobilization of the SB transposon 
in the metastatic tumour; however, no signs of the DNA footprint 
remaining after SB remobilization at these loci were observed'® 
(Supplementary Fig. 10). We suggest that these events, which may con- 
stitute driver events in the primary tumour, have arisen in the primary 
tumour after the metastases have disseminated (post-dispersion events). 
Although these known oncogenes are attractive targets for therapy, 
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their utility as targets may be limited if the targets are not also found 
in the leptomeningeal compartment of the disease. Our data from two 
separate mouse lines support a model in which medulloblastoma 
disseminates early from a restricted subclone of the primary tumour 
and in which the primary tumour and the matched metastases then 
undergo differential clonal selection and evolution. Failure to account 
for the differences between the primary and leptomeningeal compart- 
ments could lead to the failure of targeted therapies. Failure to study the 
leptomeningeal disease (Fig. 2d, e) could result in systematically over- 
looking crucial targets for therapy in this compartment. 

Examining the met-gCISs using Gene Set Enrichment Analysis 
(GSEA) demonstrated differences between the primary and metastatic 
disease, including enrichment for genes involved in the cytoskeleton in 
metastases (Supplementary Table 12). Targets that are present in both 
compartments and are maintenance genes, as exemplified by Pdgfra, 
will be optimal targets for treating both the primary tumour and the 
metastases (Fig. 2c and Supplementary Tables 7 and 9). 

Pten, Akt2, Igf2 and Pik3rl are all met-gCISs, implicating the 
phosphatidylinositol-3-OH kinase (PI(3)K) pathway in medulloblas- 
toma progression. We injected the cerebellum of Nestin-TVA mice’” 
with either an Shh-overexpressing retroviral vector (denoted Shh 
virus) or an Shh- and Akt-overexpressing retroviral vector (denoted 
Shh + Akt virus). Cerebellar injection of Shh virus alone resulted in 
medulloblastomas in 6 of 41 animals, compared with 20 of 42 animals 
injected with Shh + Akt virus (P= 0.0018). Although metastases 
were not observed with Shh virus alone (0 of 41), medulloblastoma 
metastases were observed in 9 of 42 animals injected with Shh + Akt 
virus (P = 0.0024) (Supplementary Fig. 11). In vivo modelling vali- 
dates PI(3)K signalling as a putative contributor to leptomeningeal 
dissemination of medulloblastoma. 

Previous publications and clinical approaches to human medullo- 
blastoma have largely assumed that the primary tumour and its 
matched metastases are highly similar**. To test this assertion, we 
formally reviewed all cases of medulloblastoma from the past decade 
at The Hospital for Sick Children, in Ontario, Canada, and we iden- 
tified 19 patients who had bulk residual primary tumour after surgery 
and metastases visible by magnetic resonance imaging, both of which 
could be followed for response to treatment (Supplementary Fig. 12 
and Supplementary Table 13). Although it is possible that the meta- 
stases received less radiotherapy than the primary tumour in a subset 
of patients, in 58% of all cases (11 of 19) we observed a disparate 
response to therapy between the primary tumour and the matched 
metastases (binomial test, P< 2.2 X 10 '°). Identification of definitive 
differences in the clinical response to standard therapy between the 
primary and the metastatic compartment awaits the completion of 
large, well-controlled, prospective clinical trials. 

We examined seven matched primary and metastatic medulloblas- 
tomas for copy number aberrations (Fig. 3, Supplementary Figs 13 and 
14, and Supplementary Tables 14 and 15). In each case, the primary 
tumour and the matched metastases shared complicated genetic events 
that provide strong support for their descent from a common trans- 
formed progenitor cell. Similar to our mouse data, in each case we 
observed clonal genetic events in the metastatic tumour(s) that were 
not present in the matched primary tumour (Fig. 3 and Supplementary 
Fig. 14). We also observed genetic events in the primary tumour that 
were absent from the matched metastases, consistent with a post- 
dispersion event (Fig. 3 and Supplementary Fig. 14). One patient with 
multiple leptomeningeal metastases had a deletion of chromosome 1p 
in only one of three examined metastases (Fig. 3a). This pattern of 
genetic events being present in only a subset of metastases could be a 
mechanism for the emergence of therapy-resistant metastatic clones. 

We performed interphase fluorescence in situ hybridization (FISH) 
for the known medulloblastoma oncogenes MYCN and MYC on a 
collection of 17 paraffin-embedded primary and metastatic pairs of 
human medulloblastomas'**°. MYCN was amplified in three primary 
medulloblastomas but not in the matching metastases (Fig. 3b and 
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Figure 3 | Human medulloblastoma metastases are biologically distinct 
from their matched primary tumour. a, Copy number data from a primary 
medulloblastoma (MB-C-Pri) and three patient-matched metastases (MB-C- 
Met1, MB-C-Met2 and MB-C-Met3), with chromosomal regions in red 
representing genetic gain (amplification) and in blue denoting genetic loss 
(deletion). Examples of shared clonal events (red boxes) and events limited to 
one but not all metastases (black box) are shown. Chr, chromosome. 

b, Interphase FISH shows amplification of MYCN in a primary tumour but not 
the matched metastasis. Nuclei appear blue owing to 4’ ,6-diamidino-2- 
phenylindole (DAPI) staining. c, Interphase FISH for MYC demonstrates 
amplification in both the primary tumour and its matched metastases. d, Venn 
diagrams depicting the degree of overlap and discordance in promoter CpG 
methylation events and CNAs in primary medulloblastomas and their matched 
metastases, with MB-C, MB-D and MB-F and MB-H denoting different patients. 
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Supplementary Fig. 15). Conversely, MYC was amplified in two 
primary tumours and their matching metastases (Fig. 3c). These data 
are consistent with MYCN amplification being a post-dispersion event, 
similar to examples in SB-driven mouse medulloblastoma, and 
strongly indicate that anti- MYCN therapeutics may lack efficacy in 
the metastatic compartment of human medulloblastoma. The possibility 
that MYCN amplicons in the metastases have been ‘lost’ over time 
cannot be excluded. 

We subsequently analysed promoter CpG methylation in these 
matched pairs and found much discordance between the primary 
tumour and matched metastases (Fig. 3d, Supplementary Figs 13 
and 16, and Supplementary Tables 16 and 17). Finally, we performed 
whole-exome sequencing on a limited set of matched primary and 
metastatic medulloblastomas and found many single nucleotide var- 
iants (SNVs) that were restricted to a single compartment (Sup- 
plementary Fig. 13 and Supplementary Table 18). The discordance 
of CNAs, promoter CpG methylation events and SNVs between the 
primary tumour and its matched metastases supports a bicompart- 
mental model for metastatic medulloblastoma. The mutational load in 
the human tumours (the combination of CNAs, CpG methylation and 
SNVs) compares favourably with the mutational load in our transposon- 
driven mouse models (in which the median number of gCISs is 25 per 
tumour; Supplementary Table 19). Validation of the individual CNAs 
that were restricted to the metastases showed that these CNAs can be 
detected in a very minor subclone of the primary tumour, in keeping 
with the relationship identified in the mouse model (Supplementary 
Fig. 17 and Supplementary Tables 20 and 21). Pathway analysis using 
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the Database for Annotation, Visualization and Integrated Discovery 
(DAVID) to compare mouse gCISs with the genes that were affected in 
the human metastases identified only one statistically significant 
shared signalling pathway: insulin signalling (P = 0.027) (Supplemen- 
tary Table 22). The known role of insulin receptor signalling in primary 
medulloblastoma”', together with the data presented here on the role of 
AKT in metastatic medulloblastoma, suggests that insulin signalling 
should be prioritized as a therapeutic target to be tested in clinical trials. 

We performed unsupervised hierarchical clustering on the CpG 
methylation data, and we found that normal cerebellar controls cluster 
away from the medulloblastomas, whereas metastases cluster with 
their matching primary tumour (Fig. 4a). However, metastases cluster 
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Figure 4 | Human medulloblastoma metastases are genetically distinct from 
their matched primary tumour. a, Profiling the methylation status of 27,578 
CpG dinucleotide sites in the human genome in a collection of human matched 
primary and metastatic medulloblastomas; the top 2,000 genes are shown. 
Unsupervised hierarchical clustering by CpG methylation pattern 
demonstrates that patient-matched metastases are more similar to each other 
than to the matched primary tumour. b, Unsupervised clustering of regions of 
copy number gain and loss demonstrates that patient-matched metastases are 
more similar to each other than to the matched primary tumour. 

c, Unsupervised hierarchical clustering of SNV data from whole-exome 
sequencing demonstrates that patient-matched metastases are more similar to 
each other than to the matched primary tumour. SNVs that are found only in 
the primary compartment or only in both examined tumours in the metastatic 
compartment are evident. Coph, cophenetic correlation coefficient. 
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closer to each other than they do to the matched primary tumour 
(z-test, P = 0.0014) (Supplementary Fig. 18). Unsupervised hierarchical 
clustering of CNA and exome SNV data uncovered the same relation- 
ships (Fig. 4b, c). Evident within the exome data are many events that 
are shared only by patient-matched metastases (that is, metastases 
froma single patient), as well as events that are restricted to the primary 
tumour, both of which are similar to the genetic patterns observed in 
mice. These three data sets support a model in which patient-matched 
human medulloblastoma metastases are epigenetically and genetically 
very similar to each other but have substantially diverged from the 
primary tumour, resulting in two different disease compartments: 
the primary and metastatic compartments. 

Our data from two mouse models, with support from initial data 
from human medulloblastoma, suggest that leptomeningeal metastases 
of medulloblastoma from a single human or mouse are genetically 
similar to each other but are highly divergent from the matched primary 
tumour, consistent with a bicompartmental model of disease. Our 
results are consistent with a model in which metastases arise from a 
restricted subclone of the primary tumour through a process of clonal 
selection in both humans and mice. That metastases might arise from a 
pre-existing minor subclone of the primary tumour through clonal 
selection was suggested more than three decades ago, but it remains a 
controversial hypothesis that might not be true of all cancers”. 
Failure to account for the divergent molecular pathology of the meta- 
static compartment may result in selection of therapeutic targets pre- 
sent in the primary tumour, which is more amenable to surgical control, 
but not the metastases, which are the more frequent cause of death. 


METHODS SUMMARY 

Generation of Math1-SB11 construct. SB11 cDNA was excised from the vector 
pCMV-SB11 and ligated into the vector J2Q-Math1 (refs 8, 26). 
Linker-mediated PCR and 454 deep sequencing. Bar-coded, linker-mediated 
PCR was performed as previously described®. Sample preparation for the 454 
sequencing and the subsequent procedures was performed as previously 
described”. 

Determination of gCISs. A chi-squared analysis was performed to determine 
whether the number of observed integration events within each transcription unit 
in the SB-driven medulloblastomas was significantly greater than expected given 
the following: the number of TA dinucleotide sites within the gene relative to the 
number of TA sites in the genome, the number of integration sites within each 
tumour, and the total number of tumours in each cohort. This gCIS analysis 
produced a P value for each of the ~19,000 mouse RefSeq genes, and Bonferroni 
correction was therefore used to adjust for multiple hypothesis testing. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Linker-mediated PCR and 454 deep sequencing. Genomic DNA was isolated 
and purified from mouse tissues with a DNeasy Blood & Tissue Kit (QIAGEN). 
The subsequent bar-coded, linker-mediated PCR was performed as previously 
described®. Sample preparation for the 454 sequencing and the subsequent pro- 
cedures was performed as previously described’’. 

PCR for SB-tagged fragments. The primers for amplifying SB-transposon inser- 
tion sites were designed based on the chromosomal location of each independent 
insertion site and its orientation to transcription. The primers at the inverted 
repeats/direct repeats (left) (IRDRL) and inverted repeats/direct repeats (right) 
(IRDRR) of the transposon were 5’-CTGGGAATGTGATGAAAGAAATAAAA-3' 
and 5’-TTGTGTCATGCACAAAGTAGATGT-3’, respectively. The input repre- 
sents genomic DNA with SB transposition, which was illustrated by SB excision 
PCR that detected the transposon post transposition’. Three points of input 
(1X, 5X and 25X) were used. The following primers were used: Pde4d-143L, 
5'-CACATAAAAACTGGACACCTAG-3’; Pdgfra-131R, 5'-CTATCATGACCA 
CACGGAAGAGAGTGAAC-3’; Dnajb11-143L, 5'-CATGAGCTATGGCACA 
GATAC-3’; Fubp1-143R, 5'-CACTAGTGCCCATGGATTAGG-3’; Ptges-143R, 
5'-CAGAACTGATAGAGGCCAAAG-3'; Irx2-25L, 5’-CAACACTTTCAGAC 
ACACATATATC-3’; Igf2-112R, 5'-GTGACCAGTGTGTATTCGTGGAATTT 
TTTGGG-3'; and Notch2-114R, 5'-CAGTGTCCAGGCAGTCATTTCAAAGA 
GTG-3’. Details about the primer design for specific insertion sites and the PCR 
protocol are available on request. 

Review of clinical cases. We systematically reviewed all cases of medulloblastoma 
seen at The Hospital for Sick Children (Toronto, Ontario) over the past ten years. 
Cases that have both metastases and post-operative residual bulky disease at the 
primary site were identified on the basis of post-operative imaging obtained within 
72h of surgery. All radiology results were reviewed by a senior neuro-oncologist 
(E.B.). Objective responses of both the primary tumour and the metastatic disease 
were measured using the standard International Society of Paediatric Oncology 
(SIOP) criteria for clinical trials of paediatric brain tumours”*. 

End-point PCR on human samples. For PCRs to confirm the deletion of the 
CDKN2A locus on chromosome 9, a genome-walking approach (GenomeWalker 
Universal Kit, Clontech, Catalogue number 638904) was taken to locate the spe- 
cific deletion region based on single nucleotide polymorphism (SNP) coordinates. 
The following primers flanking the deletion region were used: forward, 
5'-GCAATTAACCAAGACCACCCAATGGCAAG-3’; and reverse, 5'-GTAGC 
TATTGGGGAGGTTGAGAAGGAG-3’. Three points of input shown as ACTB 
(1X, 5X and 25x) were used. The PCR products were inserted into the pCR2.1TA 
cloning vector (Invitrogen), sequenced and searched against the human genome in 
the blast database to confirm the deletion. For REXO1L1 deletion on chromosome 
8, specific primers flanking the deletion region were designed based on SNP 
microarray results. The PCR products were TA-cloned and sequenced as 
described above. The following primers were used: forward, 5'-GGCTGACTC 
CCTTCTGATATAG-3’; and reverse, 5’-CAATCACTTACAGTTACTAGGC 
AC-3'. Details about the primer design and PCR protocols are available on 
request. 

Chromosomal mapping of gCISs. Chromosomal maps of gCIS-associated genes 
were obtained from the UCSC Mouse Genome Browser (assembly in July 2007). 
Each insertion site of a specific CIS was mapped to the gene with the same 
orientation as the direction of transcription (arrow in green) or the inverse ori- 
entation to the direction of transcription (arrow in red). 

Human medulloblastoma tumour specimens. All tumour specimens were 
obtained in accordance with the Research Ethics Board at The Hospital for Sick 
Children. Surgically resected, fresh frozen samples were obtained from the 
Cooperative Human Tissue Network and the Brain Tumor Tissue Bank. 

SB remobilization. Potential SB insertion sites at Fubp1, Mnatl or Igf2 in primary 
tumours from mouse numbers 143, 14 or 11 or sites at Ptges, Aofl and Notch2 in 
the matched spine metastases were tested for remobilization. The primers were 
designed to amplify each insertion site to produce approximately 300 base pairs 
(bp) with the insertion site in the middle. PCR products were either sequenced 
directly or after being TA-cloned. The resultant sequences were examined for 
‘scars’ from potential remobilization. As positive controls for the scars, primers 
were used to amplify the T2/Onc transposon in each sample®. The products were 
sequenced and examined for the scars as described above. The following primers 
were used: Aofl forward (Fw), 5'-TACTCCAGACAGTCAGTCAGTG-3’; Aofl 
reverse (Rv), 5’-TAGTTCTGCCTCATGCCACAAG-3’; Ptges Fw, 5'-ACAGAG 
AAGGCTTCAGAGCTC-3’; Ptges Rv, 5'-GGTGCTCTCTGCTGTCCAATC-3’; 
Notch2 Fw, 5'-CAAGCTTTCAAGTATAAACCACGC-3’; Notch2 Rv, 5'-GAAT 
GCATCATCCAGTGTCCAG-3’; Fubp1 Fw, 5'-AGGAACGGGCTGGTGTTAA 
AATG-3'; Fubp1 Rv, 5'-TCTAATACCATTTCCTTGGCTTGC-3’; Mnat1 Fw, 
5'-CTAACACATCAGAGTTGGACAAG-3’; Mnat1 Rv, 5'-CATGAAGACCTG 
AGAGTGCAG-3’; Igf2 Fw, 5'’-GTGATTGGTGAATGTACTCTTTCC-3’; and 


Igf2 Rv, 5'-GTGGAACACTAGATTCTGTAGTC-3’. Details about the primer 
design and PCR protocols are available on request. 

Hierarchical clustering. Agglomerative hierarchical clustering analyses were per- 
formed in the R statistical programming environment (version 2.13). The average 
linkage method was used in all cases. Because different data types were used in the 
various analyses, the metric used for clustering differed between the analyses. The 
Manhattan distance metric was used for the copy number data because the data 
were encoded as {—1, 0, 1}. The magnitudes of the CNAs were not considered, 
owing to a multitude of confounding factors, including tumour heterogeneity and 
ploidy. The Kendall rank correlation was used for the SNV frequency data because 
the data distributions were not normal. The Pearson correlation was used for the 
methylation data, which were normally distributed. 

Identification of CpG hypermethylation events. Human genomic DNA was 
isolated from matching primary and metastatic medulloblastomas obtained from 
Johns Hopkins University, the Virginia Commonwealth University and New York 
University. An EZ DNA Methylation Kit (Zymogen Research) was used to 
bisulphite convert 500ng each sample. The recovered DNA was profiled on 
HumanMethylation27 BeadChips (Illumina) at The Centre for Applied 
Genomics (TCAG). Subsequently 27,578 CpG dinucleotides spanning 14,495 genes 
were analysed. The probe signal intensity was corrected by using BeadStudio 3.2.0 
software (Illumina). The background normalization and differential methylation 
analyses were performed against fetal cerebella using the custom error model 
(Illumina). Cancer-specific DNA hypermethylation events were defined as those 
with a 30% increase in methylation in at least one medulloblastoma sample relative 
to an average methylation level (less than 50%) in normal fetal and adult cerebellum 
samples. Unsupervised clustering using Euclidian hierarchical clustering metrics 
was then performed on 2,503 data points that were filtered for cancer-specific 
hypermethylation events. The CpG methylation data are available from the Gene 
Expression Omnibus under accession number GSE34356. 

Bisulphite sequencing of CpG promoter methylation. Representative examples 
of primary-tumour- and metastasis-specific methylation events were identified 
from normalized Illumina Hg27 data. Bisulphite PCR (BSP) primers were 
designed using the EpiDesigner tool (SEQUENOM) (http://www.epidesigner. 
com/) to encompass a genomic region flanking the Illumina Hg27 gene-specific 
probe. DNA (500ng) from the primary tumour and the corresponding metastases 
was bisulphite converted using an EZ DNA Methylation Kit. Following PCR 
optimization, 10ng bisulphite-converted DNA was used to amplify the genomic 
regions of interest. Amplicons were subcloned into the pCR2.1-TOPO vector 
(Invitrogen), and plasmid DNA from 10-12 colonies was extracted using a 
PureLink Quick Plasmid Miniprep Kit (Invitrogen). Sequencing was performed 
at TCAG using the M13 reverse primer, 5'-CAGGAAACAGCTATGAC-3’. The 
following primers were also used: MLH1 Fw, 5'-TTGTTGGAATGTTATTTAT 
TATTTAGGA; MLHI1 Rv, 5’-CATAATATCCACCAAAAAACCAAAA-3’; 
MRPS21 Fw, 5'-TTTTTGGTTTTTGTTGATTGTTTTT-3’; MRPS21 Rv, 5’-CAA 
ATCTCAAAAAATCTATCCTTTCC-3’; RBP1 Fw, 5'-GTAGGGGAGGTATAG 
GTAGGTTGTG-3’; RBP1 Rv, 5'-CTTAATCAAACCCCCTAAACAAAAA-3’; 
WNK2 Fw, 5’-GTGTTTTTGGTTTATAGAGATGGA-3’; and WNK2 Rv, 5’-AC 
TCCTCCTAATCCRACTCTAC-3’. Details about the primer design and PCR 
protocols are available on request. 

Alignment and variant calling for whole-exome sequencing. Standard manu- 
facturers’ protocols were used to perform target capture with a TruSeq Exome 
Enrichment Kit (Illumina) and sequencing of 100-bp paired-end reads on a HiSeq 
sequencing system (Illumina). Approximately 10 gigabases of sequence was 
generated for each subject such that >90% of the coding bases of the exome 
defined by the Consensus CDS (CCDS) project were covered by at least ten reads. 
Adaptor sequences and quality trimmed reads were removed by using the FASTX- 
Toolkit (http://hannonlab.cshl.edu/fastx_toolkit/), and then a custom script was 
used to ensure that only read pairs with both mates present were subsequently 
used. Reads were aligned to Hg19 with BWA1, and duplicate reads were marked 
using Picard (http://picard.sourceforge.net/) and excluded from downstream ana- 
lyses. SNVs and short insertions and deletions (indels) were called using SAMtools 
(http://samtools.sourceforge.net/) Pileup and varFilter2 with the base alignment 
quality (BAQ) adjustment disabled and were quality filtered to require at least 20% 
of reads supporting the variant call. Variants were annotated using both 
ANNOVAR3 and custom scripts to identify whether they affected protein coding 
sequence and whether they had previously been seen in dbSNP131, the 1,000 
Genomes pilot release (November 2010) or in approximately 160 exomes that 
had previously been sequenced at our centre. 

SNV analysis of whole-exome sequencing data. For clustering analysis, an SNV 
frequency matrix was constructed by calculating frequencies from the read counts of 
the reference and the alternative nucleotide. The matrix was not standardized (that is, 
converted to z scores) before clustering, because the absolute SNV frequencies were 
of interest. 
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For Venn analysis, the samples were grouped into primary—metastasis sets, and the 

filtered SNVs were used to identify SNVs that are enriched in one sample compared 
with all other samples of the same set, as determined by the hypergeometric test (P 
value threshold = 0.05). For sets consisting of three or more samples (A, B and C), an 
SNV was considered to be enriched in samples A and B if the SNV was enriched in A 
compared with C alone and also enriched in B compared with C alone. SNVs that 
were not enriched in any sample or subset of samples were considered to be common 
SNVs. Many of these common SNVs probably represented germline SNVs specific to 
the patient. 
Analysis of CpG promoter methylation data. The similarities between the 
patient-matched metastatic and primary tumour samples and among patient- 
matched metastatic tumour samples were determined by using Pearson correla- 
tion analysis. As Pearson’s r values are not normally distributed, they were 
standardized by Fisher’s z transformation. Subsequently, the correlations between 
the metastatic samples and the matched primary tumour samples were compared 
with the correlations among the patient-matched metastatic samples, using the 
paired heteroscedastic Student’s t-test. 

Clustering analysis was performed as described above. The methylation matrix 
was not standardized before clustering, as doing so would entail discarding crucial 
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information on the differences in the overall methylation profiles among samples 
or the average methylation among CpG promoters. 

The stability of the CpG hypermethylation profile clusters was assessed using 
three methods. First, the clustering analysis was run for different numbers of CpG 
hypermethylation sites that varied most widely among samples. The partitions 
generated by each clustering run were compared with the reference partitions 
generated by the original clustering based on the 1,000 most variable hypermethy- 
lated CpG islands using the Jaccard similarity index. The same analysis was 
applied to a set of 100 background hypermethylation data matrices in which the 
sites are permuted independently in each sample. Second, the clustering analysis 
was performed for random subsamples of 1,000 sites, for 1,000 repeat runs. In each 
run, the resultant cluster was compared with the original cluster using the Jaccard 
index. Analysis on the original data matrix was compared with a set of 100 
background matrices, permuted as described above. Third, the cluster stability 
was further assessed by bootstrap resampling of the samples using the pvclust R 
package (version 1.2). 


28. Gnekow, A. K. Recommendations of the brain tumor subcommittee for the 
reporting of trials. Med. Pediatr. Oncol. 24, 104-108 (1995). 
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DCC constrains tumour progression via its 
dependence receptor activity 


Marie Castets!*, Laura Broutier!*, Yann Molin', Marie Brevet*, Guillaume Chazot', Nicolas Gadot’, Armelle Paquet”, 
Laetitia Mazelin', Loraine Jarrosson-Wuilleme’, Jean-Yves Scoazec’, Agnés Bernet! & Patrick Mehlen! 


The role of deleted in colorectal carcinoma (DCC) as a tumour 
suppressor has been a matter of debate for the past 15 years. 
DCC gene expression is lost or markedly reduced in the majority 
of advanced colorectal cancers’ and, by functioning as a depend- 
ence receptor, DCC has been shown to induce apoptosis unless 
engaged by its ligand, netrin-1 (ref. 2). However, so far no animal 
model has supported the view that the DCC loss-of-function is 
causally implicated as predisposing to aggressive cancer develop- 
ment*. To investigate the role of DCC-induced apoptosis in the 
control of tumour progression, here we created a mouse model 
in which the pro-apoptotic activity of DCC is genetically silenced. 
Although the loss of DCC-induced apoptosis in this mouse model 
is not associated with a major disorganization of the intestines, it 
leads to spontaneous intestinal neoplasia at a relatively low fre- 
quency. Loss of DCC-induced apoptosis is also associated with 
an increase in the number and aggressiveness of intestinal tumours 
in a predisposing APC mutant context, resulting in the develop- 
ment of highly invasive adenocarcinomas. These results demon- 
strate that DCC functions as a tumour suppressor via its ability to 
trigger tumour cell apoptosis. 

The development of colonic carcinoma from normal colonic 
epithelium has been shown to be associated with the mutation of a 
specific set of genes*. Loss of heterozygosity on chromosome 18q in 
more than 70% of primary colorectal tumours prompted the search for 
a tumour suppressor gene at that locus. This search led to the cloning 
of a putative cell-surface receptor, DCC’. DCC expression is markedly 
reduced in more than 50% of colorectal tumours, as well as in many 
other neoplasms (for a review, see ref. 5). Moreover, loss of DCC is 
associated with poor prognosis and potentially decreased response to 
adjuvant chemotherapy in colorectal cancer patients. Lastly, restora- 
tion of DCC expression can suppress tumorigenic growth properties in 
vitro and in nude mice’. Altogether, these data led to the proposal that 
DCC expression is a constraint for tumour progression, and thus that 
DCC functions as a tumour suppressor gene. However, a major con- 
troversy resulted from this proposal, because the localization of the 
DCC gene close to well-established tumour suppressors such as Smad4 
(ref. 6) and the absence of increased tumour susceptibility in a mouse 
model in which Dcc was mutated’ created scepticism about its poten- 
tial role as a tumour suppressor gene’. 

It has been shown that DCC belongs to the family of dependence 
receptors”. Such receptors induce apoptosis when their trophic ligands 
are absent, thus conferring a state of cellular dependence on ligand 
availability for survival’. On the basis of this classification, DCC may 
represent not a classical tumour suppressor but rather a conditional 
tumour suppressor, inducing the death of tumour cells in settings of 
ligand limitation, thus preventing invasion and metastasis, but failing 
to suppress tumour formation (and potentially supporting tumour 
progression) in settings of high ligand concentration. In support of 


this view, overexpression of the DCC ligand netrin-1 in the digestive 
tract has been shown to result in the inhibition of epithelial cell death 
and the promotion of tumour progression’. 

The pro-apoptotic signalling induced by unbound DCC requires 
its intracellular domain cleavage by caspase after aspartic acid 1290 
(refs 2, 10; Fig. la). Point mutation of the aspartic acid residue 
(D1290N) did not affect the positive signalling mediated by netrin-1 treat- 
ment (Fig. 1b). However, compared to wild-type DCC, DCC(D1290N) 
failed to trigger apoptosis in cell culture (Fig. 1c-g). To address formally 
the role of DCC as a tumour suppressor and to determine the relative 
importance of DCC pro-apoptotic activity in its putative tumour sup- 
pressor activity, we generated a mouse model with the D1290N point 
mutation in the DCC coding sequence (Fig. 1h-j). Contrary to DCC 
homozygous null mutants, which die at birth with many nervous 
system defects’, mice bearing one (DCC*/™) or two mutated alleles 
(pCc™”™*) were viable. The fact that these mice do not show obvious 
defects in the brain further supports the view that the D1290N mutation 
does not inhibit or enhance DCC ‘positive’ signalling, which has been 
shown to be required for adequate neuronal guidance'’’*. We thus 
investigated whether this mouse model showed a loss of DCC-induced 
apoptosis. Murine embryonic fibroblasts (MEFs) were cultured 
from DCC*’* or DCC™™ embryos (Fig. 2a). As predicted by the 
dependence receptor paradigm, MEF cells expressing a wild-type DCC 
underwent apoptosis in response to serum and netrin-1 deprivation, 
whereas the addition of netrin-1 delayed MEF cell death (Fig. 2b). In the 
same settings, MEF homozygous for the DCC mutation were less 
sensitive to serum withdrawal, and netrin-1 addition failed to augment 
survival (Fig. 2b). Thus, in this mouse model, cells expressing DCC 
should not undergo apoptosis in settings of netrin-1 limitation. 

In the intestine, because of the restricted expression of netrin-1 at 
the bases of the intestinal villi? (Fig. 2c, d and Supplementary Fig. 1a), 
in contrast to the uniform expression of DCC along the villi’ (Fig. 2e, f 
and Supplementary Fig. 1a), we hypothesized that epithelial cell death, 
which is generally observed at the tips of the villi, could result at least in 
part from unbound DCC-induced apoptosis. As shown in Fig. 2h-j 
and Supplementary Fig. 1b, apoptosis in the intestinal epithelium of 
Dcc™™" mice was significantly decreased compared to DCC*!* 
mice, although no change was observed in cell proliferation (Fig. 2g) or 
differentiation (data not shown). These findings, coupled with the 
previous demonstration that, in mice, ectopic expression of netrin-1 
reduces intestinal cell death, whereas netrin-1 hypomorphic mutant 
newborn mice show increased cell death’, suggest that DCC and 
netrin-1 may be homeostatic regulators of intestinal epithelial turnover 
via a dependence receptor mechanism (Supplementary Fig. 1 and 
Supplementary Fig. 1c). It is noteworthy that, even though DCC muta- 
tion was shown to decrease intestinal cell death at the tips of the villi, 
global disorganization of the intestinal epithelium of these animals 
was not observed, probably because intestinal cell apoptosis levels 
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Figure 1 | Establishment of a mouse model with a mutation of the caspase 
cleavage site in DCC. a, Without netrin-1, DCC induces apoptosis unless 
mutated in D1290. b, In HEK293T cells, netrin-1-induced DCC-mediated ERK1/ 


DCC 


2 phosphorylation is not affected by D1290N point mutation. c-f, TdT-mediated 
dUTP nick end labelling (TUNEL) assay on DCC or DCC(D1290N) transfected 


HEK293T cells (mean + s.e.m., n = 3). *P < 0.005, U-test. Representative images 
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are shown. Ctl, control. g, Caspase-3 activity on DCC or DCC(D1290N) 
transfected HEK293T cells (mean + s.e.m., n = 3). *P < 0.005, U-test. Inset: 
DCC immunoblot. h, Mutant mouse model generated by introduction of the 
D1290N point mutation in exon 26 (e26) of Dcc. Arrows indicate Cre 
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Figure 2 | Mutation of DCC caspase cleavage site is associated with reduced 
apoptosis in mice. a, DCC expression (qRT-PCR) in DCC*’* and 
Dcc™"' MEBs from embryonic day (E)13.5 embryos. b, Caspase-3 activity 
in DCC'’* and DCC™™”™™ MEFs after serum deprivation and netrin-1 
treatment (mean + s.e.m., n = 3). *P < 0.02, U-test. 

c-f, Immunohistochemistry of netrin-1 (¢c, d) or DCC (e, f) in proximal 
intestine of DCC*!* (c, e) and DCC™’™™ (d, f) mice. Right panels, 
enlargement of villi and crypt staining. g, Cell proliferation in intestinal crypt 
analysed by anti-Ki67 staining. h, Intestinal cell death in wild-type or DCC 


k 
Percentage of mice 
Genotype with tumours 
DCCt+ 0 
DCCmut/mut 14.8 


mutant mice. *P < 0.001, U-test. Representative images of pyknotic cells from 
haematoxylin-eosin-saffron staining of DCC*’* and DCC™”™" mice are 
shown. i, j, TUNEL staining of intestinal villi. No difference in apoptosis rate 
was observed between control and mutant animals in intestinal crypt (data not 
shown). k, Incidence of spontaneous tumour formation in intestines of 
Dcc™’™* (n = 28) compared to DCC*!* (n = 18) mice. Haematoxylin- 
eosin-saffron staining of an adenoma (left panel; arrow) and of an 
adenocarcinoma (right panel) observed in pcc™’/™ mice. m, mucosa; 

M, muscularis. 
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throughout the villi remain quite low under physiological condi- 
tions'’*"*. Thus, netrin-1 regulation of DCC-induced cell death is 
unlikely to be solely responsible for a general homeostatic regulatory 
process that would balance intestinal epithelial cell proliferation and 
differentiation; rather, it is more likely to limit the lifespan of cells 
arriving at the tips of the villi and, as such, to serve as a mechanism 
that limits the occurrence of genetic alterations. In support of this 
possibility, reduction of apoptosis in DCC™”™" mice was accompanied 
by spontaneous, albeit limited, intestinal tumour formation: whereas no 
tumours were detected in wild-type control littermates, 14.8% of mutant 
mice showed spontaneous neoplastic transformation, including both 
adenomas and adenocarcinomas (Fig. 2k; P< 0.05). Because intestinal 
cells are known to undergo multiple proliferative steps within the crypt, 
and repeated mechanical and chemical insults originate from the 
intestinal lumen, we propose that DCC-mediated cell death may rep- 
resent a factor for limiting the initiation of malignant transformation. 

In human pathological samples, loss of DCC is observed with especially 
high frequency in late-stage tumours. Together with the fact that spon- 
taneous neoplastic transformation was observed only at low frequency 
in these DCC mutant mice, this suggested that the putative tumour 
suppressive role of DCC may modulate a predominantly late event in 
tumorigenesis. We investigated the possibility that DCC pro-apoptotic 
activity may affect tumour progression by analysing the effect of 
the DCC(D1290N) mutation on adenocarcinoma formation in an 
APC*!18N genetic background. APC is a well-known tumour sup- 
pressor gene in human colorectal cancer, and APC mutations in mice 
are associated with neoplasm formation. We chose the APC/16°8N 
mutant mice, which were shown to develop tumours in the intestinal 
tract at a moderate level’. Consistent with a previous report, the number 
of adenocarcinomatous lesions per DCC*’* APC*/!™®N control mice 
was 0.79 + 0.77 (Fig. 3a)'*. The incidence of adenocarcinomas was 


increased by more than 2.5-fold in DCC™™”™ APC*'®8N mice, 
(Fig. 3a; P=0.0002). Moreover, whereas 21.4% of pcc*!* 
APC*!!68N control mice were tumour free (Fig. 3b), all Decne 
APCT18N mice had at least one neoplasm. Lastly, whereas 
adenocarcinomas were detected in 50% of the DCC*/* APC*/18N 
control mice, consistent with previous reports’, the frequency of 
pco™!™*APC*/168N mice with adenocarcinomas was markedly 
increased to 100% (Fig. 3a; P = 0.003). Mice heterozygous for the 
DCC mutation in the APC*’"®*8N background showed an intermediate 
phenotype, with a significantly increased number of adenocarcinomas 
compared to APC*''®*8N mice (Supplementary Fig. 2). 

Of interest, 45.5% of DCC™”™*ApCc'!®8N mice showed 
aggressive adenocarcinomas with serosal invasion, compared to 8% 
in controls (Fig. 3c, d; P<0.04). Because serosal invasion in 
Dpece™/™tapct68N mice supports the view of tumour cells 
spreading in these mutant mice, we analysed distant organs for 
metastasis. None of the mice showed macrometastatic lesions in the 
peritoneum, liver or lung. However, highly proliferative micro- 
metastases were observed in the livers of DCC™™”"™' APC */18N mice 
with adenocarcinoma with serosal invasion, but not in controls (Sup- 
plementary Fig. 3). 

Together with the finding that, in human tumours, DCC is typically 
deleted in late-stage tumours, this markedly increased aggressivness 
suggests that the loss of DCC may enhance tumour cell survival at the 
transition from adenoma to adenocarcinoma. We therefore assessed 
whether, in accordance with this hypothesis, apoptosis is quantita- 
tively different in low-grade tumours of DCC™”™APCT/°8N 
versus DCC*/* APC*/"®8N control mice. As shown in Fig. 3e and f, 
we observed a marked decrease in apoptosis rate in adenomas from 
pec™’/™tapct/168N mice compared to that in size-matched 
pcc*’*Apct®8N controls, while both tumours persistently 


a 
Number of ADK Percentage of mice 
Genotype per mouse with ADK 
APC#/1638N ECC #/* 0.79 + 0.77 50.0% 
APC+/1638N pccmut/mut 2.36 + 1.00* 100%*™* 
c 
f 
e 


Figure 3 | Inactivation of DCC-induced apoptosis favours adenocarcinoma 
formation in an APC*/"®® mutant background. a, Incidence and frequency 
of adenocarcinomas (ADK) in DCC™”™ APCT/188N mice (n = 11) 
compared to DCC*/* APC*®N mice (n = 29) (*P = 0.0002, t-test; 

**P = 0.003, Fisher’s test). Tumour classification was performed according to 
international recommendations’*; pseudo-invasion was ruled out. 

b-e, Haematoxylin-eosin-saffron staining of normal intestinal epithelium 
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(b) compared to adenocarcinomas with muscularis (c) or serosa local invasion 
(d) observed in DCC™!™ APC*/18N mice. m, mucosa; M, muscularis; S, 
serosa. e, f, Apoptosis in size-matched adenomas from DEG 4 PCT een 
mice compared to pcct!* APCt/68N mice. e, Haematoxylin-eosin-saffron 
staining. f, Apoptosis was quantified in adenomas from three mice of each 
genotype (control: 0.28%; mutant: 0.058%). *P = 0.003, U-test. 
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showed DCC expression (data not shown). Thus, the inhibition of 
DCC-induced apoptosis may modulate, in low-grade tumours, the 
balance between proliferation and death, favouring proliferation. 
Consequently, this may increase the likelihood of occurrence of 
additional genetic or epigenetic alterations, thus enhancing tumour 
progression. 

The current results provide a definitive demonstration that DCC is a 
bona fide tumour suppressor in the intestinal tract. Our results demon- 
strate the importance of DCC as a late gatekeeper, which limits tumour 
progression. Moreover, because the D1290N mutation inhibits DCC- 
induced apoptosis but not netrin-1-dependent signalling, this DCC 
tumour suppressive activity probably occurs via the ability of DCC to 
trigger apoptosis of neoplastic or pre-neoplastic cells in settings of 
netrin-1 limitation. DCC expression is not only decreased or lost in 
colorectal cancer but ina large variety of cancers such as prostate, breast, 
endometrial, ovarian, oesophageal, testicular, glial, neuroblastoma and 
hematological malignancies’. It will therefore be of interest to analyse, 
using transgenic mouse models, the tumour suppressor role of DCC in 
these malignancies. In light of our data, the function of DCC as a 
dependence receptor and a conditional tumour suppressor seems to 
represent an important safeguard mechanism, limiting tumour pro- 
gression by engaging the apoptotic process. 


METHODS SUMMARY 

Further details about materials and methods are provided in Methods. Briefly, the 
DCC(D1290N) targeting vector was constructed using a fragment of Dcc gene 
encompassing 7.9 kb around exon 26. Embryonic stem cell electroporation, selec- 
tion and culture, as well as generation of chimaeric mice and Southern blot analysis 
were performed in the Institut de la Clinique de la Souris (ICS) according to classical 
procedures. Germline transmission and genotyping were detected by Southern blot 
and PCR analysis of tail genomic DNA. APC*/'®®N mice were obtained from 
R. Fodde. Tumour analysis was performed in blind from haematoxylin-eosin- 
saffron stained sections. Apoptosis was quantified in blind on haematoxylin- 
eosin-saffron stained intestine and adenoma sections of controls and mutant 
DCC mice or after TUNEL staining. Quantitative polymerase chain reaction with 
reverse transcription (qRT-PCR), immunohistochemistry, immunoblots, MEF 
culture and cell death assays were performed as described in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culture. Human embryonic kidney HEK293T cells were grown in DMEM 
(Invitrogen), supplemented with 10% fetal bovine serum (FBS; Cambrex). 

Cell death assays. For cell death assays, 1 X 10° HEK293T cells were transfected 
with 2 ug of plasmids constructs (p-DCC-CMV-S, p-DCC-D1290N-CMV-S or 
pCMV control) using calcium phosphate, as described previously’. Transfected 
cells were serum deprived for 24h. The caspase-3 activity assay was performed 
using Caspase 3/CPP32 Fluorimetric Assay Kit, according to manufacturer’s 
instructions (Gentaur Biovision). For detection of DNA fragmentation, treated 
cells were cytospun 48h after transfection, fixed and permeabilized (4% 
paraformaldehyde, PBS1x/Triton 0.2%) and TUNEL immunostaining was per- 
formed with 300 U ml! TUNEL enzyme and 6 uM biotinylated dUTP (Roche 
Diagnostics), as previously described’’. The extremities of the biotinylated DNA 
were revealed using Cy-3-coupled streptavidine (Jackson Immunoresearch) at a 
dilution range of 1:1,000. TUNEL-positive cells and nuclei are then respectively 
stained in red (Cy3) and blue (Hoechst). TUNEL staining of intestinal villi was 
performed on 4-um thick sections of DCC‘/* and DCC™”"" intestines fixed in 
formalin and paraffin embedded according to the same procedure. 

Pyknotic cells show retracted and hyperchromatic nuclei. Quantification of 
pyknotic cells at the top of villi was performed in blind on ten different fields of 
proximal intestine sections stained with haematoxylin-eosin-saffron from two 
different mice of each genotype (Dcc*’* and Dcc™™"), Quantification of 
apoptosis in size-matched adenomas from DCC*/* and DCC™”™"" mice was 
performed in blind according to standard morphological criteria, including evid- 
ence of nuclear and cytoplasmic alterations. 

Immunoblot analysis. Induction of ERK1/2 phosphorylation by netrin-1 was 
achieved as described previously”. In brief, 1 X 10° HEK293T cells were trans- 
fected with 12,1g of plasmids constructs (p-DCC-CMV-S, p-DCC-D1290N- 
CMV-S or pCMV control) using calcium phosphate, as described previously’, 
serum starved for 24h and treated with 150ng ml”! netrin-1 (AG-40B-0075, 
Adipogen) for 15 min. Immunoblots were performed as already described using 
anti-DCC (1/1,000, Pharmingen), anti-ERK1/2 (1/1,000, Sigma), anti-phospho- 
ERK1/2 (1/1,000, Cell Signaling) primary antibodies. 

Generation and analysis of mice with DCC(D1290N) mutation. The 
DCC(D1290N) targeting vector was constructed using a fragment of Dcc gene 
encompassing 7.9 kb around exon 26. Embryonic stem cell electroporation, selec- 
tion and culture, as well as generation of chimaeric mice and Southern blot analysis 
were performed in the Institut de la Clinique de la Souris (ICS) according to 
classical procedures. APC*/!®®N mice (in C57BL/6 background; a gift from 
R. Fodde) were mated with mice homozygous for the DCC mutation in a pre- 
dominant C57BL/6 background. Double heterozygous DCC™* APC*/1°°8N 
mice of the offspring were interbred to generate mice homozygous for the DCC 
mutation and heterozygous for APC. Routine genotype analysis of mice was 


performed by PCR assay on DNA purified from tail biopsies (Extract N-Amp 
Tissue PCR kit, Sigma Aldrich). Wild-type and D1290N mutant alleles were 
distinguished using primers (CTGGAAACTTCCTTCTTGCTGGAGAAC/ 
CTGGTTATGGGGACAGAGAGTGC) localized around the residual Lox P site. 
All experiments were performed in accordance with the relevant guidelines and 
regulations of the animal ethics committee (Authorization no. CLB-2012-024; 
accreditation of laboratory animal care by CECCAPP, ENS Lyon-PBES). 
Quantitative RT-PCR. Total mRNAs were extracted from tissues or MEF cells 
using Nucleospin RNAII kit (Macherey-Nagel) and 1 jig was reverse transcribed 
using the iScript cDNA Synthesis kit (Bio-Rad). Real-time quantitative RT-PCR 
was performed on a LightCycler 2.0 apparatus (Roche) using the Light Cycler 
FastStart DNA Master SYBERGreen I kit (Roche). Oligonucleotide sequences are 
available on request. 

MEFs. MEFs were prepared from individual embryos at E13.5 bearing DC 
and DCC™™t genotypes. In brief, the head and internal organs were removed, 
and the remaining tissue was minced and dispersed in 0.1% trypsin. Cells were 
grown in DMEM (Invitrogen), supplemented with 10% fetal bovine serum (FBS; 
Cambrex) for two populations doubling and frozen. For the caspase-3 assay, 
1.5 10° DCC*’* and Dcc™”" cells were plated with or without serum and 
treated or not with 150 ng ml! netrin-1 (AG-40B-0075, Adipogen). The caspase 
assay was performed 6h after as described above. 

Immunohistochemistry. Immunohistochemistry on 4-pm-thick sections of 
intestines from 30-week-old DCC*/* and DCC™'”™ mice was performed as 
described previously’, using anti-netrin-1 (1/2,000, Ab-2, Calbiochem), anti- DCC 
(1/20, A-20, Santa-Cruz) and anti-Ki67 (1/150, Dako). Netrin-1, DCC and Ki-67 
expression are revealed by brown DAB staining. Nuclei are counterstained by 
haematoxylin (blue). 

Tumour analysis. For analysis of spontaneous neoplasia occurrence, pec** 
and DCC™”"™ mice were euthanized at 18 months. Tumours were detected in 
different places of the intestines including the colon. DCC*/* APC*/1°8%, 
poct/™ APCT/168N and DCC™!™* APC*68N mice were euthanized at 
30 weeks. 

In all cases, intestines were removed and examined for the presence of neoplasia. 
Tumours were resected, formalin fixed and paraffin embedded. 4-j1m-thick sections 
were stained with haematoxylin-eosin-saffron. Histological classification and grading 
of neoplastic lesions was performed in a blinded fashion and according to standard 
procedures’*"*, 
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Deleted in colorectal carcinoma suppresses 
metastasis in p53-deficient mammary tumours 


Paul Krimpenfort', Ji-Ying Song’, Natalie Proost!, John Zevenhoven!, Jos Jonkers* & Anton Berns>* 


Since its discovery in the early 1990s the deleted in colorectal cancer 
(DCC) gene, located on chromosome 18q21, has been proposed as 
a tumour suppressor gene as its loss is implicated in the majority of 
advanced colorectal and many other cancers’. DCC belongs to the 
family of netrin 1 receptors, which function as dependence receptors 
as they control survival or apoptosis depending on ligand binding. 
However, the role of DCC as a tumour suppressor remains contro- 
versial because of the rarity of DCC-specific mutations and the pres- 
ence of other tumour suppressor genes in the same chromosomal 
region. Here we show that in a mouse model of mammary carcinoma 
based on somatic inactivation of p53, additional loss of DCC pro- 
motes metastasis formation without affecting the primary tumour 
phenotype. Furthermore, we demonstrate that in cell cultures 
derived from p53-deficient mouse mammary tumours DCC expres- 
sion controls netrin-1-dependent cell survival, providing a mech- 
anistic basis for the enhanced metastatic capacity of tumour cells 
lacking DCC. Consistent with this idea, in vivo tumour-cell survival 
is enhanced by DCC loss. Together, our data support the function of 
DCC as a context-dependent tumour suppressor that limits survival 
of disseminated tumour cells. 

In the developing nervous system, netrin 1 receptors, DCC and 
UNCSH, regulate axon guidance by mediating chemo-repulsion and 
attraction as a result of ligand interaction” ~*. These receptors also act as 
dependence receptors inducing cell survival in the presence of their 
ligand netrin 1 or cell death in the absence of netrin 1 (refs 5, 6). 
Consequently, netrin 1 overexpression or loss of expression of its 
receptors confers a selective advantage to (tumour) cells. Over- 
expression of netrin 1 has been implicated in lung cancers’ and in 
metastatic breast cancer®, and enforced netrin 1 expression in the 
mouse gastrointestinal tract contributes to adenocarcinoma forma- 
tion’. Downregulation of netrin 1 receptors has been reported in the 
progression of multiple cancers including colorectal, breast, ovary, 
stomach, lung and kidney cancers'’, and is associated with loss of 
heterozygosity or epigenetic silencing'’. However, although specific 
tumour-related misssense mutations in coding exons have been 
reported for UNCS5H (ref. 12), inactivating mutations were not found 
in DCC (refs 13, 14). The lack of definitive support, the possible impact 
of loss of heterozygosity or epigenetic silencing on expression of other 
tumour suppressor genes in the same chromosomal region and the 
absence of a cancer phenotype of Dcc-deficient mice'* have raised 
doubts about whether DCC has a tumour suppressor function’®. 

To directly address the role of DCC loss in tumour progression we 
introduced a Cre/loxP-technology-based conditional mutant Dec allele 
into a well-defined mouse model for mammary carcinoma. In this 
model cytokeratin 14 (K14) promoter-driven Cre expression ablates 
both p53 alleles in mammary epithelial cells, resulting in mammary 
tumours'”'*. We have chosen this model to study the effect of DCC 
loss for two reasons. First, the expression of UncSh genes is positively 
regulated by p53 (refs 19, 20, 21) and consequently, under p53- 
deficient conditions any effect of DCC loss could no longer be masked 


by Unc5 expression. Second, mammary tumours arising in the K14- 
cre p53'”" model are well-encapsulated and only rarely metastasize!”"’, 
allowing in vivo assessment of the effects of additional mutations on 
tumour invasion and metastasis”. 

Dec is a large gene stretching over 1.2 Mb and is located on human 
and mouse chromosome 18 (ref. 14). To generate a conditional knock- 
out for Dec (Dcc*) we inserted loxP sites in introns 22 and 23 by 
homologous recombination in embryonic stem cells, enabling the 
Cre-mediated excision of exon 23 encoding the transmembrane (Sup- 
plementary Fig. 1a). Details of the Dec allele are shown in Sup- 
plementary Fig. 2 and in the Methods. Homozygous Dcc’”" mice are 
born in normal Mendelian ratios and do not show any aberrancies. 
Previously generated Dcc-deficient mice showed defects in axonal 
projections in the central nervous system and died shortly after birth’’. 
To confirm that Cre-mediated deletion of exon 23 results in a non- 
functional Dec allele we analysed the phenotype of mice carrying the 
recombined Dec allele (Dec?) obtained from crosses of Dec’ mice 
with germline-expressing Cre mice. All homozygous Dec???” mice 
died within 1 day of birth and we noticed defects in the central nervous 
system that were very similar to those that have been described for the 
Dec knockout mice", for example, all Dec??*””” pups showed complete 
absence of the corpus callosum (Supplementary Fig. 1b) and immuno- 
staining showed the loss of Dcc expression in the cerebral cortex 
(Supplementary Fig. 1c). We then generated cohorts of female K14- 
cre Dec'’* p53’ and K14-cre Dec’ p53" mice and compared 
mammary tumour development with respect to latency, histopathology 
and progression in these mice to a cohort of female K14-cre p53’ mice. 
In all three cohorts a high incidence of mammary neoplasias was 
observed (80%), with similar latency as proved by the complete over- 
lapping of Kaplan-Meier curves of mammary-tumour-free survival 
(Fig. 1a). The lesions were found in all the mammary glands, either as 
a single tumour or as multiple independent tumours. Macroscopically, 
they were solid with clear demarcations. Microscopically, a comparable 
diversity of tumour cell properties was observed in the three cohorts. 
Tumours were classified as moderately differentiated adenocarcinomas 
and/or carcinosarcomas. Ductal and/or solid nests and trabecular 
structures could be seen in the carcinomatous compartment, whereas 
sarcomatoid lesions were mainly composed of spindle cells that were 
arranged in bundles and interlacing structures (Fig. 2 and Supplemen- 
tary Fig. 3). Immunohistochemical characterization of the tumours 
from the three cohorts did not show significant differences in marker 
profile using antibodies against epithelial (CK8, CK14, E-cadherin) and 
mesenchymal markers (vimentin, smooth muscle actin) or in apoptotic 
or proliferative activity (Supplementary Fig. 4). An overview of the 
mammary tumours that were observed in the three cohorts is presented 
in Table 1. 

However, whereas the primary mammary tumour phenotypes were 
similar in all three groups, metastases in K14-cre Dec’” p53" mice were 
much more frequent (6 out of 14) than in K14-cre Dect’* pos” F and 
K14-cre p53" ¥ cohorts (in total, 1 out of 21 (P< 0.01; Table 1)). These 
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Figure 1 | DCC loss does not affect latency of p53-deficient mammary 
tumour development. a, Tumour-free survival curves of K14-cre p53" WF K14- 
cre Dec’ * p53" and K14-cre Dec” p53"* mice. b, Tumour-free survival of 
mice with and without metastases. c, Southern blot analysis of tumours from 
K14-cre Dec’ p53" mice showing Cre-mediated recombination of p53" and 
Dec" alleles. Red asterisk, lanes with metastatic tumours. Numbers indicate 
different mice. 


metastases were found in draining lymph nodes and/or lung. They 
showed a similar immunohistochemical staining pattern to their cor- 
responding primary tumours (Fig. 2 and Supplementary Fig. 5). The 
latency of metastasizing tumours did not differ from that of the non- 
metastasizing tumours (Fig. 1b) and both primary carcinoma and 
carcinosarcoma could give rise to metastasis formation (Supplemen- 
tary Table 1). From four metastatic and six non-metastatic tumours 
from K14-cre Dec” p53" mice we could isolate sufficient DNA for 
Southern blot analysis to estimate Cre-recombination-mediated exon 
23 deletion. We found that the Dec" allele is not a very efficient Cre 
recombinase substrate and the extent of exon 23 deletion varied con- 
siderably (see Fig. 1c). Out of the ten tumours that we were able to 
analyse only six showed widespread deletion of exon 23, and out of 
these six there were four that showed metastasis formation, whereas 
none of the four tumours with only partial exon 23 deletion showed 
metastatic spread (Supplementary Table 1). These data indicate that 
the loss of DCC function per se is not selected for in primary tumour 
development but that its loss facilitates metastasis. 

Dec is expressed at high levels in distinct regions of the brain 
and at much lower levels in various other tissues. Polymerase chain 
reaction with reverse transcription (RT-PCR) analysis showed that 
DCC is expressed in mammary tumours from K14-cre p53"”" mice 
and its expression is lost in many of the primary tumours of K14- 
cre Dec" p53'”" mice. None of these tumours showed features that 
are associated with metastasis, such as loss of E-cadherin expression”. 
We speculated that the increased metastasis formation in the K14- 
#”" p53'" mice might relate to the ability of DCC to trigger 


23,24 


cre Dec 
apoptosis when netrin 1 availability is limited, for example, in specific 
conditions when cells have disseminated from the primary tumour 
mass. To address this issue in an accessible system we used p53- 
deficient (p53? ”P) cell cultures derived from K14-cre p53" “F tumours 
and assayed whether cell survival under apoptosis-inducing conditions 
is dependent on the netrin 1-DCC interaction. RT-PCR showed that 
Dec expression varied between the p53”? tumour cell lines. To study 
the effect of netrin 1 on apoptosis we used three tumour cell lines 
expressing DCC and two with no, or hardly detectable, Dcc expres- 
sion (Fig. 3a). We tested the consequences of serum deprivation, the 
absence of cell-matrix interaction (on non-coated polystyrene dishes) 


LETTER 


Figure 2 | Microphotographs of primary mammary carcinosarcoma (left) 
and metastasis in the lung (right) in serial sections. HE staining of the 
primary carcinosarcoma together with immunohistochemical stains of CK8, 
CK14 and vimentin reveals epithelial as well as mesenchymal differentiations. 
However, the metastasis in the lung mainly shows epithelial properties. Scale 
bars; left column, 50 jim; right column, 20 tum. Asterisk, indication of 
carcinomatoid differentiation of the tumour. 


or both. Apoptosis induction in these cultures was quantified by 
cell-surface phosphatidylserine expression using fluorescein (FITC)- 
conjugated annexin 5. The effect of adding netrin 1 was calculated as a 
percentage reduction in the apoptosis induced by serum deprivation. 
Under adherent culture conditions, with or without serum, 
apoptotic cells were hardly detectable in the cell cultures tested and 
thus a netrin 1 effect on cell survival could not be measured. Apoptosis 
can be induced in p53” tumour cells grown in the absence of cell- 
matrix interaction”, although the extent of apoptosis induction varied 
between the tumour cell lines tested (Fig. 3b and c). In tumour cell lines 
with no—or hardly detectable—DCC expression, serum deprivation 
resulted in a mild increase of apoptosis on which netrin 1 addition had 


Table 1 | Frequent metastasis formation of mammary tumours in the 
K14-cre Dec'”* p53* cohort 


Carcinoma Carcinosarcoma Metastasis 
K14-cre p53" 8/16 (50%) 13/16 (81%) 1/16 (6%) 
K14-creDec”* p53" = 4/7 (57%) 5/7 (71%) 0/7 
Total* 12/23 (52%) 18/23 (78%) 1/23 (4%) 
K14-creDec p53 6/14 (43%) 10/14 (71%) 6/14 (43%)+ 


* Total refers to the sum of the values for K14-cre p53" mice and K14-cre Dec”* p53'" mice. 

+ P<0.05 between the proportion of metastases in K14-cre p53" and that in K14-cre Dec p53" 
mice; P< 0.01 between the proportion of metastases in the sum of K14-cre p53" and K14-cre Deo!“* 
p53" mice and that in K14-cre Dec’ p53" mice. 
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Figure 3| DCC controls apoptosis induction in p53- 

deficient tumour cells in vitro and survival in vivo. a, RT-PCR analysis of Dec 
and Gapdh expression in p53-deficient tumour cell cultures. b, Representative 
FACS analysis (dot plot and histogram) of cells stained with TOPO3 and 
annexin 5 after 24-h culture in non-adherent conditions of p53-deficient 
tumour cultures 3, 6, 4 and 8 in the presence and absence of serum with and 
without the addition of netrin 1. c, Percentage of apoptotic cells 24 h after serum 


virtually no effect. By contrast, serum withdrawal from cultures of 
DCC-expressing p53” tumour cells led to a substantial increase in 
apoptotic cells. This apoptosis could be partly rescued by the addition 
of netrin 1. Depending on the tumour cell culture the rescuing effect of 
netrin 1 varied from 20% to 40% (Fig. 3b and c). These data indicate 
that the netrin 1-DCC interaction is functional in apoptosis regulation 
in mammary tumour cells. Moreover, these observations support our 
premise that the response of p53-deficient cells to netrin 1 relies on 
DCC, as the absence of p53 results in abrogation of expression of other 
netrin 1 receptors. In p53-proficient cells DCC loss would not provide 
substantial protection from apoptosis as the other netrin 1 receptors 
would still convey an apoptotic signal in the absence of netrin 1. To test 
whether DCC loss also confers increased apoptosis resistance or sur- 
vival in vivo we performed intravenous transplantation experiments 
using two cell lines derived from the same primary tumour, one pro- 
ficient in DCC (65-6) and the other deficient in DCC (65-4). We 
confirmed that 65-6 cells are much more sensitive than 65-4 cells to 
apoptosis induction after serum withdrawal. Consistent with the in 
vitro data we found that 5 days after intravenous injection there were 
significantly more tumour cell clusters (P < 0.03) (for a representative 
image see Supplementary Fig. 6) in the lungs of mice transplanted with 
DCC-deficient cells than in the lungs of mice that received DCC- 
proficient cells (Fig. 3d). Moreover, in mice that were injected with 
DCC-proficient cells the lesions showed many more apoptotic bodies. 
It has been shown previously that DCC does not play an important 
part in tumour development’’. However, apart from using a different 
tumour setting they addressed the effect of DCC loss in an Apc™”- 
sensitized but p53-proficient background. Our observations do not 
contradict their observation. Consistent with their results, we show 
that DCC loss is irrelevant for primary tumour development. Dec loss 
does not affect tumour latency or tumour phenotype and it is not 
selected for in primary tumour development. However, we observe a 
significantly enhanced metastatic capacity of p53-deficient tumours 
cells after loss of DCC (P <0.01). In human tumours loss of DCC 
results from genetic and epigenetic events affecting a large region on 
chromosome 18q21, which harbours multiple genes with tumour sup- 
pressor activity, such as SMAD4 and SMAD2 (also known as Jv18). 
Notably, in mice the loss of Smad2 (ref. 25) or Smad4 (ref. 26) leads to 
progression of Apc” -driven intestinal tumours, but without additional 
metastasis formation. Many trivial reasons can underlie the contradic- 
tion between our idea that the effect of DCC loss is p53-dependent and 
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deprivation in cell cultures 3, 4, 6, 7 and 8 in the absence (—) or presence (+) of 
netrin 1; graph represents four independent experiments. d, Quantification of 
numbers of tumour cell clusters (left) and percentage of tumour cell clusters 

showing apoptotic bodies (right), per 5-t1m section, in the lungs of Balb/c nu 
mice (n = 7) 5 days after intravenous injections with 1 x 10° DCC-proficient 
65-6 cells (grey bar) or DCC-deficient 65-4 cells (purple bar). Error bars, s.e.m. 


the classical multistep “Vogelgram’ model for colorectal carcinogenesis 
in which DCC is lost before p53 inactivation”. We favour the scenario 
in which 18q21 deletions, including DCC, occur before p53 loss owing 
to the selective advantage that is conveyed by the loss of the other 
tumour suppressors in this chromosomal region. DCC loss would then 
serve as a passenger mutation that is initially harmless but becomes 
critical after subsequent p53 inactivation as it then promotes the sur- 
vival of cells released from the primary tumour, thereby facilitating 
their colonization of other tissues. In contrast to the small metastases 
we observe in our mouse model, DCC-deficient metastases in patients 
are life threatening. They are composed of cells that have been selected 
for higher malignancy during primary tumour growth. Indeed, loss of 
DCC in breast cancer cells has been associated with worse prognosis 
with a higher risk of recurrent disease”. 


METHODS SUMMARY 


Animal experiments comply with international regulations and ethical guidelines, 
and have been authorized by the experimental animal committee at The Netherlands 
Cancer Institute. 

The Dec" strain was obtained through embryonic-stem-cell targeting. K14-cre 
and p53" strains have been described”. 

For histopathological studies, mice were killed when seriously ill or when tumours 
reached a diameter of 1.5 cm. Tissues were fixed in EAF (ethanol-acetic acid—formol; 
acidified 4% formalin; ethanol/acetic acid/formol/saline at 40:5:10:45 v/v). 
Haematoxylin and eosin (HE) stains were performed according to standard 
procedures. A list of all the antibodies used can be found in the Methods section. 
RNA expression analysis was performed using TaqMan quantitative PCR methods. 
Apotosis in cell cultures was assayed by measuring annexin 5-positive cells by 
fluorescence-activated cell sorting (FACS) as described”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Construction of DCC conditional mice. The strategy to conditionally inactivate 
Dec was based on deleting exon 23, which encodes the DCC transmembrane 
domain using the Cre/loxP technology. With probes for exons 22, 23 and 24 we 
screened a 129/SV phage library (Stratagene) and isolated overlapping phage 
clones spanning a 20-kb region that starts 4kb 5’ of exon 22 and ends 3 kb 3’ of 
exon 24. In the targeting vector we introduced a loxP-Neo-TK-loxP selection 
cassette in the Bln1 restriction site present in the 22nd intron and a single loxP 
sequence in the Csp451 restriction site present in the 23rd intron. This targeting 
vector was electroporated into 129/Ola-derived E14-IB10 embryonic stem cells. 
Correctly targeted neomycin-resistant embryonic-stem-cell clones also harbour- 
ing the single loxP sequence in the Csp451 site were identified by Southern blot 
analysis. In these clones the Neo-TK expression cassette was removed by transient 
Cre expression as described”, resulting in embryonic-stem-cell clones containing 
loxP sequences in introns 22 and 23. Dee'’* embryonic stem cells were injected 
into C57BI/6 blastocysts, and chimaeras were crossed with FVB/N mice to produce 
heterozygous offspring. The resulting Dec'”* heterozygous and subsequent Dec!” 
homozygous mice were viable and fertile, and showed a normal life span, indi- 
cating that the Dec’ allele is fully functional. 

DNA analysis. For genotyping, PCR analysis for the presence of the Dec" allele 
was based on the presence of /oxP (1) inserted in the Bln1 site 5’ in intron 22. 
Forward primer (located 5’ of the 5’ loxP (1) sequence), CAAGACACATG 
GAAGGTGAAATG; reverse primer (located 3’ of the 5’ loxP (1) sequence), 
GACCTCACTTACATATCAAAATGG; the expected fragment size for wild-type 
Dec was 200 bp and for Dec" was 300 bp (carrying the loxP sequence). 

For PCR analysis of the Dec” allele after Cre-mediated recombination (deletion 
of exon 23); forward primer, 5’ loxP (1); reverse primer (located 3’ of loxP (2) in 
the Csp451 site in intron 23); CCCAAATCTTCTATATTACAATATC. The 
expected fragment size was 400 bp. 

Southern blot analysis was used for Cre-recombination-mediated exon 23 dele- 
tion: DNA was digested with BamHI and BglII, and probed with exon 24; analysis 
of p53 inactivation by Cre recombination was performed as described’”. 

Other mouse strains used. Mouse strains carrying conditional alleles for p53 
(p53"), K14-promoter driven Cre recombinase expression and germline expres- 
sion of Cre recombinase have been described". 


RNA expression analysis. Total RNA was isolated using Qiagen RNeasy Mini Kit. 
For subsequent complementary DNA synthesis we used a First Strand cDNA 
Synthesis Kit for RT-PCR (Roche). For Dec we used PCR expression analysis with 
TaqMan probe Mn00514509 and for Gapdh we used TaqMan probe Mn99999915. 
Histological analysis. Tissues were isolated and fixed in EAF saline fixative 
(ethanol/acetic acid/formol/saline at 40:5:10:45 v/v). Tissues were processed routinely 
as for histology purposes. For immunohistochemistry, sections were blocked with 3% 
H,O) for endogenous peroxidases and incubated with primary antibody, and then 
stained with biotin-conjugated secondary antibodies and incubated with horseradish- 
peroxidase-conjugated streptavidin-biotin complex (DAKO). Substrate was 
developed with DAB (DAKO). 

Antibodies. The following antibodies were used: mouse anti-E-cadherin (1:400; 
BD Pharmingen), rat anti-cytokeratin (CK) 8 (1:800; University of Iowa), rabbit 
anti-CK14 (1:10,000; BabCo), mouse anti-SMA (1:10; Zymed), rabbit anti-SMA 
(1:500; Neomarkers), caspase 3 (Cell Signaling; 9661L), KI67 (Monosan; 
PSX1028), DCC (A-20) Santa Cruz (sc-6535). Secondary antibodies were as fol- 
lows: biotin-conjugated anti-mouse, anti-rat and anti-rabbit antibodies (DAKO). 
Derivation and culture tumour cell lines. For the isolation of primary tumour 
cells, a 50-100 mm?’ tumour sample was finely chopped using a Mcllwain tissue 
chopper (Mickle Laboratory Engineering) and digested for 1 h at 37 °C in serum- 
free DMEM-F12 medium (Invitrogen Life Technologies) containing 0.1 mg ml * 
porcine pancreatic trypsin (Difco) and 0.2 mg ml collagenase A (Roche). Cells 
were washed and fibroblasts were allowed to adhere for 1 h at 37 °C. Non-adherent 
epithelial cells were removed and cultured in DMEM-F12 medium containing 
10% fetal bovine serum (FBS; ICN), 100IU ml! penicillin, 100 ug ml! 
streptomycin, 5 ng ml * insulin, 5 ng ml‘ epidermal growth factor (all Invitrogen 
Life Technologies) and 5 ng ml * cholera toxin (Sigma). 293T cells were cultured 
in Iscoves medium (Invitrogen Life Technologies) containing 10% FBS, 
100IU ml’ penicillin and 100 pg ml”! streptomycin. 

Apoptosis assay. Cells were plated at a density of 400,000 cells per well in an ultra- 
low cluster polystyrene culture dish with six wells (Corning). Cells were incubated 
in the presence of either 10% FCS or 0.5% BSA (serum free). Netrin 1 was added at 
a concentration of 100 ng ml !. After 24h, cells were collected and incubated at 
37 °C with 0.25% trypsin (Invitrogen) for 1 min to prevent cell aggregation. FITC- 
conjugated annexin 5 (IQ Products) and ToPro-3 (Molecular Probes) were added 
and annexin-5-positive apoptotic cells were analysed by FACS as described”. 
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The same pocket in menin binds both MLL and JUND 
but has opposite effects on transcription 


Jing Huang’?, Buddha Gurung**, Bingbing Wan!?*, Smita Matkar®, Natalia A. Veniaminova*, Ke Wan’?, Juanita L. Merchant*®, 


Xianxin Hua? & Ming Leib? 


Menin is a tumour suppressor protein whose loss or inactivation 
causes multiple endocrine neoplasia 1 (MEN1), a hereditary auto- 
somal dominant tumour syndrome that is characterized by tumor- 
igenesis in multiple endocrine organs’. Menin interacts with many 
proteins and is involved in a variety of cellular processes” *. Menin 
binds the JUN family transcription factor JUND and inhibits its 
transcriptional activity”’. Several MENI missense mutations dis- 
rupt the menin-JUND interaction, suggesting a correlation between 
the tumour-suppressor function of menin and its suppression of 
JUND. activated transcription”’®. Menin also interacts with mixed 
lineage leukaemia protein 1 (MLL1), a histone H3 lysine 4 methyl- 
transferase, and functions as an oncogenic cofactor to upregulate 
gene transcription and promote MLL1-fusion-protein-induced 
leukaemogenesis*”"””. A recent report on the tethering of MLL1 
to chromatin binding factor lens epithelium-derived growth factor 
(LEDGF) by menin indicates that menin is a molecular adaptor 
coordinating the functions of multiple proteins’’. Despite its 
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Figure 1 | Overview of the human menin-MLL1\gm complex structure. 

a, Isothermal titration calorimetry measurement of the menin-MLL1 pm 
interaction. The inset shows the isothermal titration data. b, Overall structure of 
the menin-MLL1);3m complex. The N-terminal domain is shown in orange, the 
thumb domain in green, the palm domain in blue, the fingers domain in cyan, 


importance, how menin interacts with many distinct partners and 
regulates their functions remains poorly understood. Here we pre- 
sent the crystal structures of human menin in its free form and in 
complexes with MLL1 or with JUND, or with an MLLI-LEDGF 
heterodimer. These structures show that menin contains a deep 
pocket that binds short peptides of MLL1 or JUND in the same 
manner, but that it can have opposite effects on transcription. The 
menin-JUND interaction blocks JUN N-terminal kinase (JNK)- 
mediated JUND phosphorylation and suppresses JUND-induced 
transcription. In contrast, menin promotes gene transcription by 
binding the transcription activator MLL1 through the peptide 
pocket while still interacting with the chromatin-anchoring protein 
LEDGEF at a distinct surface formed by both menin and MLLI. 
The amino-terminal region of MLL1 interacts with menin’*’*”. 
Isothermal titration calorimetry measurements showed that the 
menin-binding motif (residues 6-25) of MLL1 (MLL1)4pm) is necessary 
and sufficient for menin binding (Fig. 1a and Supplementary Fig. la—c). 


Conserved 


Variable 


and loop regions that are disordered or not included in the crystal structure are 
shown as dashed lines. MLL1 pm is shown as a stick model in yellow. c, The 

surface representation of menin indicates that menin adopts a curved left-hand- 
shaped conformation. d, Front view of the menin-MLL1 gm complex, coloured 
according to the degree of amino acid conservation among menin homologues. 
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MLL2, the closest relative of MLL1, contains a sequence that is almost 
identical to MLL1ypm at its N terminus (Supplementary Fig. 1b); 
MLL2,¢6_35 (MLL2);3y;) binds to menin with an affinity that is com- 
parable to that of MLL1 pm (Supplementary Fig. 1d). To understand 
how MLLI and MLL2 (collectively referred to as MLL) are recognized 
by menin, we determined the crystal structures of human menin alone 
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or in complex with MLLIypm (Supplementary Fig. 2, Supplementary 
Table 1 and Supplementary Information). The structure of human 
menin closely resembles a recently published menin homologue struc- 
ture from Nematostella’’. 

The conformation of menin resembles a curved left ‘hand’ with a 
deep pocket formed by its ‘thumb’ and ‘palm’ (Fig. 1b, c). Menin consists 
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Figure 2 | Structural and mutational analyses of the menin-MLL1 gm 
interaction. a, Stereo view of the menin-MLL1 py interface. The 
intermolecular hydrogen bonds are shown as dashed magenta lines. 

b, Phe 9™™" (yellow) is nested in a hydrophobic pocket of menin formed by the 
thumb (green) and the palm (blue). , Electrostatic surface potential of the 
MLL1\ypm-binding cavity of menin (positive potential, blue; negative potential, 
red). d, Co-immunoprecipitation of wild-type (WT) or mutant menin and 


MLL1 


Menin 


H3K4me3 IgG 


MLLI proteins from 293T cells. Arrows and asterisks indicate the positions of 
MYC-MLL1 and immunoglobulin G (IgG), respectively. IB, immunoblot; IN, 
input; IP, immunoprecipitation. e, f, Expression of Hoxc8 (e) and distributions 
of menin, MLL1 and H3K4me3 at the Hoxc8 promoter (f) in Menl-‘~ mouse 
embryonic fibroblasts (MEFs) complemented with control vector, WT or 
mutant menin (n = 6; error bar, standard deviation). 
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of four associated domains: an N-terminal domain characterized by a 
long B-hairpin, a transglutaminase-like domain that forms the thumb, a 
helical palm domain that contains three TPR motifs’” and a carboxy- 
terminal fingers domain (Fig. 1b, c, Supplementary Fig. 3 and Sup- 
plementary Information). Menin is highly conserved across species, 
and the conserved residues are either buried in the hydrophobic core 
or clustered together on a surface patch that covers the thumb and 
palm (Fig. 1d). MEN1 disease-derived missense and in-frame deletion 
mutations are evenly distributed throughout the protein (Supplemen- 
tary Fig. 4), indicating that all four domains are important for the in vivo 
function of menin (Supplementary Fig. 4 and Supplementary Table 2). 

The MLL1ypm peptide adopts a compact conformation and plugs 
into the deep pocket of menin (Fig. 2a and Supplementary Fig. 5). 
Mutagenesis data indicates that MLL]; residues Arg 6-Trp 7-Arg 
8-Phe 9-Pro 10-Ala 11-Arg 12-Pro 13 and their interacting residues in 
menin contribute the most towards the interaction (Supplementary 
Figs 6 and 7, and Supplementary Tables 3 and 4). The side-chain of Phe 
9M" fits into a hydrophobic cavity formed by the thumb and palm of 
menin (Fig. 2b). A menin Met278Trp substitution altered the cavity 
shape and led to complete loss of binding (Supplementary Table 4). The 
MLL1y\spm-binding pocket is highly acidic (Fig. 2c). The two C-terminal 
arginine residues (Arg 24 and Arg 25) in MLL1 gm are disordered, but 
they seem to be important for interaction, given that glutamate sub- 
stitution resulted in a 21-fold decrease in binding affinity (Supplemen- 
tary Table 3). Consistent with this, mutation of the acidic residues of 
menin also led to decreased binding (Supplementary Table 4). 

Next, we examined the MLL1\gm-binding activity of several MEN1 
disease-derived mutations (His139Asp, Cys241Phe, Ala242Val, 
Gly281Arg, Ala284Gln, and Thr344Arg). Except for Ala284Gln and 
Thr344Arg, which yielded insoluble proteins, the remaining mutants 
impaired the menin-MLL1 4pm interaction (Supplementary Table 4). 
To further examine the menin-MLL] interaction in vivo, we studied 
the interactions of mutant proteins that are transiently expressed in 
human embryonic kidney 293T cells. Consistent with the isothermal 
titration calorimetry analysis, co-immunoprecipitation data showed 
that mutations of the key residues at the interface completely abolished 
the menin-MLLI interaction in cells (Fig. 2d). 

Menin upregulates the expression of homeobox genes Hoxc8 and 
Hoxc6 (ref. 5). To test the effect of the menin—-MLL interaction on the 
expression levels of Hoxc8 and Hoxc6, wild-type and MLL-binding 
deficient mutants of menin were individually used to complement 
menin-null mouse embryonic fibroblasts. Western blot analyses indi- 
cated comparable expression of wild-type and mutant proteins in cells 
(Supplementary Fig. 8a). When MenI ‘~ cells were complemented 
with wild-type menin, expression of Hoxc8 and Hoxc6 dramatically 
increased compared to vector-expressing cells (Fig. 2e and Sup- 
plementary Fig. 8b). In contrast, overexpression of the menin mutants 
in Men1 ‘cells failed to upregulate the messenger RNA levels of Hoxc8 
or Hoxc6 (Fig. 2e and Supplementary Fig. 8b), suggesting that the 
menin-MLL interaction is essential for Hoxc8 and Hoxc6 expression. 

Next we performed chromatin immunoprecipitation (ChIP) assays 
to determine the binding of mutant menin at the Hoxc8 promoter. 
Except for Ala284Gln (a mutant that leads to insoluble proteins), all 
other mutants bound to the Hoxc8 promoter as effectively as wild-type 
menin (Fig. 2f). Expression of wild-type or mutant menin did not 
greatly affect H3 distribution at the Hoxc8 promoter (Supplemen- 
tary Fig. 8c). Notably, Men1 ‘~ cells complemented with wild-type 
menin exhibited a substantial increase in MLL1 binding and histone 
H3K4me3 trimethylation at the Hoxc8 promoter compared with 
vector-expressing or mutant-menin-expressing cells (Fig. 2f). 
Therefore, although menin mutants were able to bind to the Hoxc8 
promoter, their ability to recruit MLL1 and thus establish H3K4me3 at 
the Hoxc8 promoter was compromised, resulting in reduced Hoxc8 
expression. 

LEDGF, a chromatin-associated protein’, is required for MLL1- 
dependent transcription and leukaemic transformation’. Isothermal 
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titration calorimetry measurement showed that a complex composed 
of menin and an N-terminal fragment of MLLI, called MLL1ypm-tam 
(comprising residues 6-153 and including both menin-binding and 
LEDGF-binding motifs) binds to the integrase binding domain of 
LEDGF (LEDGFjgp) with an affinity of 470 nM (Fig. 3a and Sup- 
plementary Fig. 9a). In contrast, neither menin nor MLL1ypm-tpm 
alone could interact with LEDGFigp (Supplementary Fig. 9b)’*. We 
determined the menin-MLL1\ypu-Lam-LEDGFigp complex structure 
at a resolution of 3.0 A (Supplementary Fig. 9c and Supplementary 
Table 1). MLL1y4p-Lpm exhibits an extended conformation and binds 
to menin through two major sites (Fig. 3b); the N-terminal MLL1 gm 
coil folds into the high-affinity pocket of menin in the same manner as 
in the menin-MLL1 yp» structure (Supplementary Figs 9d, e), whereas 
the C-terminal helix «2 packs on the surface of the N-terminal domain 
of menin to form a V-shaped groove for LEDGF gp binding (Fig. 3b and 
Supplementary Fig. 9f). The middle loop of MLL1pm-tpm Spans a large 
distance on menin without many specific interactions except for two 
leucine residues (Leu 106 and Leu 116) with side-chains that point to 
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Figure 3 | Structure of the menin-MLL1ygm.1gm-LEDGF\gp ternary 
complex. a, Domain organization of menin, MLL1 and LEDGE. The menin- and 
LEDGF-binding motifs and LBM motifs of MLL] are shown in yellow, menin in 
cyan, the integrase-binding domain of LEDGF in red and other regions in grey. 
Interactions among the three proteins are shown in orange. b, Ribbon diagram of 
the menin-MLL1\ygm-13m-LEDGFjgp complex. Menin is in cyan, 
MLL1\pm_-1pm in yellow and LEDGFigp in red. c, The extended MLL1 loop 
between MLL1 gm and MLL1, py covers a large part of the surface area of menin. 
d, Detailed view of the intermolecular three-helix-bundle at the ternary interface. 
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two shallow pockets on the menin surface, defining the path of the loop 
(Fig. 3c and Supplementary Fig. 9g). Helix aE of LEDGFigp is 
sandwiched between helices «2 of MLL] and «4 of menin through both 
hydrophobic and electrostatic interactions (Fig. 3d). In support of the 
crystal structure, mutations of residues on the «4 helix of the 
N-terminal domain of menin (Ala95Arg and Serl04Tyr) specifically 
disrupted the interaction with LEDGF (Supplementary Table 5 and 
Supplementary Fig. 10). Notably, Men1~'~ cells that were comple- 
mented with these two mutants failed to stimulate Hoxc8 expression 
(Supplementary Fig. 11), suggesting that a functional menin-MLLI- 
LEDGF complex is required for upregulation of Hoxc8 expression. 
Together, our data show that menin functions as an adaptor molecule 
to modulate gene expression by binding MLL] at one site while also 
interacting with LEDGF at a distinct surface. 

Although MLL1 and MLI2 share many functional motifs, including 
the menin-binding motif (Supplementary Fig. 12), MLL2 does not 
contain a LEDGF-binding motif sequence and thus would not form 
a ternary complex with menin and LEDGF. Given that the PWWP 
domain of LEDGF, which contains a relatively well conserved Pro- 
Trp-Trp-Pro signature, is required for MLL1-mediated leukaemic 
transformation’*'’, the inability of MLL2 to form a menin-MLL2- 
LEDGF complex explains why only MLL1, and not MLL2, has so far 
been described as a proto-oncogene that can be activated by chromo- 
somal translocations. 

Menin also interacts directly with transcription factor JUND*’. We 
defined JUND residues 27-47 as the menin-binding motif 
(JUNDmpm) with an affinity of 1.6 uM (Fig. 4a and Supplementary 
Fig. 13). Sequence comparison of JUNDwpm and MLL1 pm revealed a 
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striking similarity (Fig. 4a), suggesting that JUNDpm might interact 
with menin through the same binding pocket as does MLL1ypm.- 
Consistent with this idea, both isothermal titration calorimetry and 
glutathione S-transferase (GST) pull-down assays showed that MLL1 
could efficiently compete with JUND for menin binding (Supplemen- 
tary Fig. 14). 

We determined the menin-JUNDypm complex structure, which 
shows many similarities to the menin-MLL]ypm structure (Fig. 4b 
and Supplementary Table 1). First, the Phe-Pro-(Ala or Gly)-(Arg or 
Ala)-Pro motifs in both menin-binding motifs are almost identical in 
overall conformation (Supplementary Fig. 15a). Second, Phe32, Pro33 
and Pro36 of JUND interact with menin in the same way as their counter- 
parts in MLL1\ygm (Supplementary Fig. 15b, c). Notably, two lysine 
residues (Lys 46 and Lys 47) in JUNDypyp equivalent to the disordered 
Arg 24 and Arg 25 in MLLI pm, are visible in the electron density map 
and point to an acidic surface on menin (Supplementary Fig. 15c). 
Mutation of these lysine residues and other key binding residues at the 
interface abolished or weakened the interaction both in vitro and in vivo 
(Fig. 4c, Supplementary Fig. 16 and Supplementary Table 6). 

Menin uncouples JUND phosphorylation from JNK activation, but 
the mechanism is poorly understood’’. The consensus JNK-docking 
domain (D-domain) contains a cluster of basic amino acids preceding 
two leucine residues” (Fig. 4d). JUNDyygm is partially overlapped with 
a putative D-domain of JUND (JUNDp)”' (Fig. 4d). Both the basic 
residues and the leucine residues in JUNDp are indispensable for JNK 
docking on JUND as well as JNK-mediated JUND phosphorylation 
(Fig. 4e, f and Supplementary Fig, 17a, b). Thus, Lys 46 and Lys 47 
are both required for menin binding and JUND phosphorylation by 
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Figure 4 | Structural and functional studies of the menin-JUND 
interaction. a, Sequence alignment of the menin-binding motif sequences of 
JUND, MLL1 and MLI2. Conserved residues are highlighted in yellow. 

b, Crystal structure of the menin-JUND,ygm complex. Menin is coloured as in 
Fig. 1c and JUNDmpm is shown as a purple stick model. ¢, Co- 
immunoprecipitation of WT or mutant menin and JUND from 293T cells. 

d, Sequence comparison of the N termini of JUND and c-JUN. The menin- 
binding motif sequence of JUND is highlighted in purple. Key residues in the 
JNK-docking domain are denoted with blue dots and three phosphorylation 


sites are labelled. e, In vitro GST-pull-down analysis of the interactions between 
FLAG-tagged JNK and the indicated JUN proteins. f, In vitro phosphorylation 
of WT or mutant JUND by JNK. g, In vitro phosphorylation of WT or menin- 
binding-deficient JUND by JNK in the presense or absence of menin. h, Menin 
suppresses JUND phosphorylation in response to anisomycin activation of JNK 
in 293T cells. i, WT or mutant JUND plasmids were transfected into 293T cells 
with AP1 and Renilla reporter plasmids, and with or without menin. Luciferase 
assays were performed 2 days after transfection ( = 4; error bar, standard 
deviation). 
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JNK. This led us to test whether menin inhibits JUND phosphorylation 
through sequestering JUND from JNK. In GST-pull-down assay, GST- 
JUND can only pull down JNK in the absence of menin, indicating that 
menin has a higher affinity to JUND (Fig. 4e and Supplementary 
Fig. 17a). Furthermore, when menin was added, phosphorylation of 
JUND was clearly inhibited (Fig. 4g and Supplementary Fig. 17c). In 
contrast, phosphorylation of menin-binding-deficient mutants of 
JUND was not affected by menin (Fig. 4g and Supplementary 
Fig. 17c). Next, we examined whether menin could inhibit JUND 
phosphorylation in response to anisomycin activation of JNK in 
293T cells’. Although wild-type JUND phosphorylation was sup- 
pressed by menin, menin-binding deficient mutants remained robustly 
phosphorylated in the presence of menin (Fig. 4h). Notably, menin had 
no effects on JNK binding and JNK-mediated phosphorylation of 
c-JUN, a close homologue of JUND that lacks a menin-binding motif 
(Fig. 4d, e, g and Supplementary Fig. 17a, c, d). Together, our findings 
reveal that the menin-JUND interaction blocks JNK docking on JUND 
and inhibits the JNK-mediated phosphorylation. 

Menin represses JUND-mediated transcriptional activation*’. To 
examine whether this repression depends on the menin-JUND inter- 
action, wild-type or menin-binding-deficient mutants of JUND were 
co-transfected into 293T cells with an AP1 luciferase reporter plasmid 
in the presence or absence of menin (Supplementary Fig. 17e). 
Consistent with previous studies, transactivation by JUND was effec- 
tively repressed by menin”” (Fig. 4i). In contrast, menin exhibited a 
marginal effect on mutant JUND-mediated transcriptional activation 
(Fig. 4i). We recently demonstrated that JUND induces gastrin gene 
expression in human AGS gastric cells and that this induction can be 
suppressed by menin”’. Consistent with the luciferase assay, menin 
failed to suppress the gastrin upregulation that was induced by mutant 
JUND, suggesting that the menin-JUND interaction is important in 
gastrin expression regulation (Supplementary Fig. 18). Thus, we con- 
clude that the menin-JUND interaction plays a key part in suppressing 
JUND-mediated transcriptional activation. 

In summary, our structural and functional studies provide a mech- 
anistic explanation of how menin could both positively and negatively 
regulate gene transcription. Our findings also provide evidence that 
menin acts as a scaffold protein to assemble a menin-MLL1-LEDGF 
ternary complex to coordinate gene transcription and promote MLL1- 
fusion-protein-induced leukaemogenesis. 


METHODS SUMMARY 


Human menin, LEDGFypp and the MLL and JUND peptides were expressed in 
Escherichia coli BL21(DE3) and purified by sequential affinity and gel-filtration chro- 
matography purification. Menin crystals were obtained in sitting drops over 100 mM 
sodium cacodylate (pH 6.5) and 1.4 M sodium acetate. Crystallization of menin with 
the MLL1ygm or JUNDyypm peptides was achieved by sitting-drop diffusion with a 
well solution containing 100 mM Tris-HCl (pH 7.0), 200 mM MgCl, and 2.3 M NaCl. 
The menin-MLL1\ygm-1gm-LEDGFjgp complex was crystallized by hanging-drop 
vapour diffusion against a well solution of 50 mM HEPES (pH 7.0), 1.6 M (NH4)2SO4, 
10 mM MgCl, 0.016% L-canavanine, 0.016% O-phospho-L-serine, 0.016% taurine, 
0.016% quinine, 0.016% sodium glyoxylate monohydrate and 0.016% cholic acid, and 
were dehydrated with the solution containing 50 mM HEPES (pH 7.0), 2.3 M 
(NH4)2SO, and 10 mM MgCl. The menin-MLL1 ym complex structure was deter- 
mined by multi-wavelength anomalous dispersion to a resolution of 3.0 A. The 
structures of menin alone, the menin-MLL1\y3m-13m-LEDGF {pp ternary complex, 
and the menin-JUNDygm complex were solved by molecular replacement and 
refined to resolutions of 2.5 A, 3.0 A and 2.85 A, respectively. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Protein expression and purification. To facilitate crystallization, we genetically 
deleted an unstructured loop (residues 460-519) in menin, a short fragment (residues 
40-45) in JUNDmpm and two loop regions (residues 16-22 and 36-102) in 
MLL1y\gm-1em- All the resulting proteins retain wild-type-like binding affinities 
(Supplementary Figs 2d, 9c and 13d). For simplicity, MeninA, JUNDypmA and 
MLLIygm-tpmA are referred to as menin, JUNDygm, and MLL1ygm-tam 
respectively, unless stated otherwise. 

Various human menin proteins and the MLL and JUND peptides were 
expressed in E. coli BL21(DE3) using a modified pET28b vector with a SUMO 
protein fused at the N terminus after the His, tag. After induction for 16 h with 0.1 
mM isopropylthiogalactoside (IPTG) at 25 °C, the cells were collected by 
centrifugation and the pellets were resuspended in lysis buffer (50 mM Tris- 
HCl, pH 8.0; 50 mM NaH,PO,; 400 mM NaCl; 3 mM imidazole; 10% glycerol; 
0.1 mg ml 1 lysozyme; 2 mM 2-mercaptoethanol; 1 mM PMSF; 5 mM benzamidine; 
1 pg ml" leupeptin; and 1 jg ml’ pepstatin). The cells were then lysed by sonica- 
tion and the cell debris was removed by ultracentrifugation. The supernatant was 
mixed with Ni-NTA agarose beads (Qiagen) and rocked for 2 h at 4 °C before elution 
with 250 mM imidazole. Ulp1 protease was then added to remove the Hiss-SUMO 
tag. After Ulp1 digestion, the menin proteins and the MLL and JUND peptides were 
further purified by gel-filtration chromatography on Hiload Superdex 200 and 
Hiload Superdex 75 columns (GE Healthcare), equilibrated with buffer A (25 mM 
Tris-HCl, pH 8.0; 150 mM NaCl; and 5 mM dithiothreitol (DTT)) and buffer B 
(100 mM ammonium bicarbonate), respectively. The purified menin proteins were 
concentrated to 25 mg ml ' and stored at —80 °C. The purified peptides were 
lyophilized and resuspended in water at a concentration of 50 mg ml ' and stored 
at —80 °C. 

For the menin-MLL1ypm-tpm-LEDGFjpp complex, we cloned LEDGF {gp into 
a modified pET28b vector with a SUMO protein fused at the N terminus after the 
Hisg tag. MLL1pm-1pm was cloned into a GST fusion protein expression vector, 
pGEX6p-1 (GE healthcare). The menin-MLL1\y;3-1nm complex and LEDGFigp 
itself were expressed in E. coli BL21(DE3), respectively. The menin- 
MLL1ysgm-Lpm Complex was purified by sequential affinity chromatography with 
Ni-NTA agarose beads and glutathione sepharose 4B beads (GE Healthcare). After 
removal of the Hiss~UMO tag and GST tag with Ulp1 and Protease 3C, respec- 
tively, the complex was purified further with gel-filtration chromatography on a 
Hiload Superdex 200. Meanwhile, LEDGFigp was purified in the same way as 
menin and then mixed with the purified menin-MLL1\gm-tpm complex with a 
molar ratio of 2:1. After 1 h incubation on ice, the protein mixtures were purified 
again with gel-filtration chromatography on a Hiload Superdex 200 column. 

For the in vitro assays, mutant menin proteins were expressed in E. coli and 

purified following the procedure described above. All the mutant menin proteins 
displayed unaltered biophysical properties as analysed by gel-filtration chromato- 
graphy (data not shown), ensuring that the altered affinities of the menin mutants 
for MLL1 pm. MLL yygu-tpm—-LEDGF pp and JUNDyypm are not attributable toa 
change in the structural integrity of the resulting proteins. 
Crystallization, data collection and structure determination. Menin was 
crystallized by sitting-drop vapour diffusion at 4 °C. The precipitant solution 
contained 100 mM sodium cacodylate trihydrate (pH 6.5) and 1.4 M sodium 
acetate trihydrate. For the menin-MLL1yypm complex, purified menin was first 
mixed with the MLL1,ygy peptide at a molar ratio of 1:2 and then the mixture was 
incubated on ice for 1 h to allow complex formation. Crystallization of the complex 
was achieved by sitting-drop vapour diffusion at 4 °C with the well solution 
containing 100 mM Tris-HCl (pH 7.0), 200 mM MgCl, and 2.3 M NaCl. A similar 
procedure was also used for crystallization of the menin-JUNDygm complex. The 
menin-MLL1\ypm-tpm-LEDGFipp complex was crystallized by hanging-drop 
vapour diffusion at 4 °C with the well solution containing 50 mM HEPES (pH 
7.0), 1.6 M (NH4)2SO4, 10 mM MgCl, 0.016% L-canavanine, 0.016% O-phospho- 
L-serine, 0.016% taurine, 0.016% quinine, 0.016% sodium glyoxylate monohydrate 
and 0.016% cholic acid. The crystals were then dehydrated with the solution 
containing 0.05 M HEPES (pH 7.0), 2.3 M (NH4)2SO, and 0.01 M MgCh. 

All of the crystals were gradually transferred into a harvesting solution contain- 
ing the respective precipitant solutions plus 5 M sodium formate, before being 
flash-frozen in liquid nitrogen for storage. Data were collected under cryogenic 
conditions (100 K). Selenomethionine-multi-wavelength anomalous dispersion 
data set of the menin-MLL1\;p., complex at the Se peak and inflection wave- 
lengths were collected at the Advanced Photon Source (APS) beamline 21-ID-D 
and processed using HKL2000 (ref. 23). Seven selenium atoms were located and 
refined, and the multiwavelength anomalous diffraction data phases were calcu- 
lated using SHARP™. The initial multi-wavelength anomalous dispersion map of 
the menin—MLL1yypm complex was substantially improved by solvent flattening. 
A model was manually built into the modified experimental electron density using 
O (ref. 25) and further refined in Phenix*’. Native data sets of menin and the menin 
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complexes were collected at the APS beamline 21-ID-D and processed using 
HKL2000. The structures were determined by molecular replacement using 
Phaser in the CCP4i suite”’ and further refined in Phenix. The majority (~95%) 
of the residues in all structures lie in the most favoured region in the 
Ramachandran plot, and the remaining structures lie in the additionally stereo- 
chemically allowed regions in the Ramachandran plot. 

Isothermal titration calorimetry. The equilibrium dissociation constants of the 
menin-MLLygm, menin-JUNDygm and menin-MLL1\gm-tam-LEDGFjgp 
interactions were determined using a VP-ITC calorimeter (MicroCal). The bind- 
ing enthalpies were measured at 20 °C in 25 mM Tris-HCl (pH 8.0) and 150 mM 
NaCl. Two independent experiments were performed for every interaction 
described here. Isothermal titration calorimetry data were subsequently analysed 
and fitted using Origin 7 software (OriginLab) with blank injections of peptides 
into buffer subtracted from the experimental titrations before data analysis. 
Yeast two-hybrid assay. The yeast two-hybrid assays were performed using the 
yeast L40 strain harbouring pBTM116 and pACT2 (Clontech) fusion plasmids. 
The colonies containing both plasmids were selected on -Leu -Trp plates. The 
activities of -galactosidase were measured according to Clontech 
MATCHMAKER library protocol and the averages from three individual trans- 
formants were reported. 

Plasmid construction. To generate recombinant retroviruses, pMX-2 FLAG- 
menin was constructed by inserting polymerase chain reaction (PCR)-amplified 
menin cDNA into the BamHI/NotlI site of the retroviral vector pMX-2 FLAG. 
To generate menin mutants, pMX-2 FLAG -menin was used as a template for 
site-directed mutagenesis using the QuikChange kit from Agilent. 

Cell culture and transfection. Menin-null MEFs, HEK293T and the human AGS 
gastric adenocarcinoma cell line were cultured in Dulbecco’s modified Eagle’s 
medium complemented with 10% fetal calf serum and 1% PenStrep. Menin-null 
MEFs were infected with empty vector, wild-type or mutant menin-expressing 
retroviruses and were subjected to puromycin selection (2 jig ml‘) 72 h post- 
infection for 2 days. AGS and 293T cells were transiently transfected with the 
indicated expression vectors using Lipofectamine 2000 (Invitrogen) for 48 h. 
Co-immunoprecipitation. Human 293T cells were transfected with pcDNA3.1 
vectors encoding c-MYC-tagged MLL1 (residues 1-153) and FLAG-tagged 
menin. Two days after transfection the cells were resuspended in 1 ml of lysis 
buffer (20 mM Tris-HCl, pH 7.5; 150 mM NaCl; 1.0% Triton X-100; 1 mM EDTA; 
and protease inhibitor cocktail). Immunoprecipitation of lysates was conducted 
using 20 ul anti-FLAG M2 affinity agarose (Sigma). After washing with lysis 
buffer, immunoprecipitated proteins were eluted with <2 loading buffer (50 
mM Tris-HCl, pH 6.8; 2% SDS; 10% 2-mercaptoethanol; 10% glycerol; and 
0.002% bromophenol blue), subjected to protein gel-electrophoresis using 
4-20% SDS-polyacrylamide gel electrophoresis (SDS-PAGE) and then trans- 
ferred to a polyvinylidene fluoride (PVDF) membrane. After blocking with 
TBST buffer containing 5% skimmed milk, proteins on the membrane were 
detected by western blot using anti-FLAG (Sigma) and anti-c-MYC (Santa Cruz 
Biotechnology) antibodies. The same procedure was also used for the co-immu- 
noprecipitation experiments for menin and JUND. 

Quantitative real-time PCR analysis. Exponentially growing MEFs were seeded 
at 2X 10° cells per 100-mm dish and harvested 2 days later. AGS cells were 
transfected with the menin and JUND expression vectors for 48 h. Total RNA 
was isolated with an RNeasy minikit from Qiagen. Quantitative real-time PCR 
(qRT-PCR) was performed in an ABI 7500 Real Time PCR system (Applied 
Biosystems). 

ChIP assay. MEFs were cross-linked with 1% formaldehyde for 10 min at 37 °C. 
Cross-linking was stopped by addition of 125 mM glycine. The ChIP assay was 
performed using the QuikCHIP kit from Imgenex, according to the manufac- 
turer’s instructions. Antibodies used for ChIP were anti-menin (Bethyl labs), 
anti-MLLI, anti-histone H3K4me3, anti-histone H3 and IgG (Abcam). 
Antibody-precipitated DNA-protein complex was reverse cross-linked, and the 
DNA was isolated using phenol-chloroform extraction and the precipitated DNA 
was used as the template for PCR. 

GST-pull-down assay. GST, GST-fused c-JUN (residues 1-246), GST-fused 
JUND (residues 1-150) and FLAG-tagged JNK3 were expressed in E. coli 
BL21(DE3) and were purified to homogeneity. GST-pull-down assays were per- 
formed by incubating 10 jig of GST or GST-JUN, 10 ug of FLAG-JNK3 with 10 pl 
of glutathione sepharose 4B beads and either with or without 20 1g of full-length 
menin in binding buffer (50 mM Tris-HCl (pH 8.0) and 150 mM NaC]) at 4 °C 
overnight. The beads were then extensively washed with binding buffer four times 
and the bound proteins were eluted with 10 mM reduced glutathione in binding 
buffer. After separation on 15% SDS-PAGE and Ponceau S staining, FLAG-tagged 
JNK3 protein was detected by western blot using anti-FLAG antibody. 

In vitro kinase assay. WT c-JUN (residues 1-246) and WT or mutant JUND 
proteins (residues 1-150) were expressed in E. coli BL21(DE3) and purified as 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


described above for the purification of menin. For the in vitro kinase assay, 2 1g 
substrate was mixed with 0.5 jig kinase and 50 uM ATP, either with or without 10 
ug of full-length menin protein in the kinase buffer (50 mM Tris-HCl, pH 7.5; 20 
mM MgCl; 20 mM f-glycerophosphate; 2 mM DTT; and 0.1 mM sodium ortho- 
vanadate), and incubated at 30 °C for 1 h. The reaction mixtures were then 
separated on 15% SDS-PAGE, visualized with Ponceau S staining and the phos- 
phorylated JUN proteins were detected with anti-JUND phosphor-Ser100 (anti-c- 
JUN phosphor-Ser 73) antibody (Cell Signaling). 

In vivo kinase assay. 293T cells were transfected with expression vectors encoding 
FLAG-tagged menin and c-MYC tagged JUND. After 48 h of transfection, cells 
were incubated for 30 min with or without 10 tg ml~’ anisomycin (Sigma), a 
potent JNK activator, and then the cell lysates were subjected to western blot with 
anti-JUND phosphor-Ser100 (anti-c-JUN phosphor-Ser 73, Cell Signaling), anti- 
FLAG and anti-c-MYC antibodies. 

Luciferase assay. 293T cells were transfected with 1 tg of AP1 luciferase reporter 
plasmid (Stratagene), which contains seven copies of AP1-binding consensus 
12-O-tetradecanoylphorbol 13-acetate-response element (TRE) upstream of the 
luciferase reporter gene), 0.25 1g of Renilla reporter plasmid and 0.5 ug of WT or 
mutant JUND plasmids, either without or with 0.5 pg of menin cDNA. Luciferase 
assays were performed using the dual luciferase assay kit (Promega) 2 days after 
transfection. To determine the protein expression in each transfection, 20 jug of cell 
lysates were immunoblotted with anti-menin (Bethyl Laboratories) and anti- 
JUND (Santa Cruz Biotechnology) antibodies. 


Cell fractionation. 10° 293T cells were collected and washed in cold PBS and 
hypotonic buffer (10 mM Tris-HCl, pH 7.3; 10 mM KC]; 1.5 mM MgCl; 0.2 mM 
PMSF; and 10 mM B-mercaptoethanol). The cells were then allowed to swell for 15 
min in hypotonic buffer. The swelled cells were then homogenized with glass 
Dounce homogenizer (Wheaton) using the loose pestle until cell membrane lysis 
was 80-90%. The nuclei were collected by centrifuging for 15 min at 3,300g, 
resuspended in high salt buffer (600 mM KCI; 20 mM Tris pH 7.4; 25% glycerol; 
1.5 mM MgCl; and 0.2 mM EDTA) and homogenized to break the nuclear 
membrane. The nuclear extracts were collected by centrifugation at 25,000g for 
30 min and were then fractionated on a Superose 6 gel-filtration column (GE 
Healthcare). The resulting fractions were resolved by 10% SDS-PAGE and probed 
with anti-menin, anti-MLL1 and anti-JUND antibodies. 
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Structure of the human M2 muscarinic acetylcholine 
receptor bound to an antagonist 
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The parasympathetic branch of the autonomic nervous system 
regulates the activity of multiple organ systems. Muscarinic receptors 
are G-protein-coupled receptors that mediate the response to 
acetylcholine released from parasympathetic nerves'*. Their role 
in the unconscious regulation of organ and central nervous system 
function makes them potential therapeutic targets for a broad 
spectrum of diseases. The M2 muscarinic acetylcholine receptor 
(M2 receptor) is essential for the physiological control of cardio- 
vascular function through activation of G-protein-coupled inwardly 
rectifying potassium channels, and is of particular interest because of 
its extensive pharmacological characterization with both orthosteric 
and allosteric ligands. Here we report the structure of the antagonist- 
bound human M72 receptor, the first human acetylcholine receptor to 
be characterized structurally, to our knowledge. The antagonist 
3-quinuclidinyl-benzilate binds in the middle of a long aqueous 
channel extending approximately two-thirds through the mem- 
brane. The orthosteric binding pocket is formed by amino acids that 
are identical in all five muscarinic receptor subtypes, and shares 
structural homology with other functionally unrelated acetylcholine 
binding proteins from different species. A layer of tyrosine residues 
forms an aromatic cap restricting dissociation of the bound ligand. 
A binding site for allosteric ligands has been mapped to residues 
at the entrance to the binding pocket near this aromatic cap. The 
structure of the M2 receptor provides insights into the challenges of 
developing subtype-selective ligands for muscarinic receptors and 
their propensity for allosteric regulation. 

Muscarinic receptors constitute a family with five subtypes, M1-M5 
(ref. 1). M1, M3 and M5 subtypes couple with the G, family of G 
proteins, and M2 and M4 subtypes with the G,/G, family of G proteins. 
Previous work showing that the muscarinic action by a series of choline 
esters and other substances in various tissues could be differentiated 
from their nicotinic action’ led to muscarinic acetylcholine receptors 
being defined as a functional concept. Muscarinic receptors are now 
known to be G-protein-coupled receptors (GPCRs)? and the nicotinic 
receptor a ligand-gated ion channel. Muscarinic receptors were 
initially defined biochemically as proteins that specifically bound 
3-quinuclidinyl-benzilate (QNB) and N-methylscopolamine (NMS). 
They were among the first GPCRs to be purified from cerebral mem- 
branes’, and to be functionally reconstituted with purified G protein 
in lipid vesicles’. The M1 receptor* together with the B, adrenergic 
receptor® were the first neurotransmitter-activated GPCRs to be 
cloned, revealing the seven transmembrane (TM) segment topology 
initially observed for rhodopsin’, and subsequently found to be 
common to all members of the GPCR family. 

As a consequence of their roles in both the central and para- 
sympathetic nervous systems, muscarinic receptors are targets for 
the treatment ofa spectrum of disorders including Alzheimer’s disease, 


schizophrenia and Parkinson’s disease, and chronic obstructive 
pulmonary disease*. However, developing highly subtype-selective 
orthosteric drugs for muscarinic receptors has been challenging and 
thus far largely unsuccessful. Recent drug discovery efforts have 
therefore shifted to the development of small molecule allosteric 
modulators. Muscarinic receptors have long been a model system 
for studying allosteric regulation of GPCR signalling because of their 
exceptional propensity to bind allosteric ligands’. To understand 
better the structural basis for challenges in developing orthosteric 
drugs and the susceptibility for allosteric regulation, we obtained a 
crystal structure of the M2 receptor. 

In our initial efforts to obtain the structure of the M2 receptor 
we expressed and purified M2 receptor lacking most of the third 
intracellular loop (IL3) and the native glycosylation sites. The central 
part of IL3 of the M2 receptor can be removed without impairing its 
ability to bind to agonists or activate G proteins'®, and IL3 was shown 
to have a flexible structure’. Using this modified M2 receptor bound 
to the high-affinity inverse agonist R-(—)-3-QNB, we performed 
crystallization by hanging-drop vapour diffusion and obtained crystals 
that diffracted to around 9 A, but were not able to improve the quality 
of these crystals. We subsequently replaced IL3 of the M2 receptor 
with T4 lysozyme (T4L) as initially described for the B, adrenergic 
receptor’” (Supplementary Fig. 1a). This method has been used to obtain 
crystal structures of four other GPCRs: the adenosine Az, receptor’’, the 
CXCR4 receptor", the dopamine receptor D3 (ref. 15) and most recently 
the histamine H, receptor'®. The binding properties of M2-T4L with 
muscarinic ligands were essentially the same as for the wild-type M2 
receptor (Supplementary Fig. 1b, c), indicating that the overall TM 
architecture of M2-T4L was minimally affected by introduction of 
TAL. The M2-T4L receptor was subsequently crystallized in lipidic cubic 
phase. A 3.0 A structure was solved by molecular replacement from a 
data set obtained by merging diffraction data from 23 crystals. 

As is typical for proteins crystallized by the lipidic cubic phase 
method, the lattice for the M2 receptor shows alternating aqueous 
and lipidic layers with M2 receptor molecules embedded in the latter 
while T4L is confined to aqueous regions (Supplementary Fig. 2). 
Within the membrane plane, receptor molecules are packed closely 
against one another, alternating orientations within the bilayer. There 
are abundant hydrophobic contacts between receptor molecules 
within the membrane, whereas polar interactions primarily involve 
contacts between T4L molecules as well as receptor-T4L interactions. 

The overall structure of the M2 receptor (Fig. 1a) is similar to that of 
rhodopsin and other recently crystallized inactive GPCR structures 
(compared in Supplementary Fig. 3). The cytoplasmic surface of the 
M2 receptor is in an inactive conformation, but as with most other 
GPCR structures, there is no interaction involving Arg 121°°° (super- 
scripts indicate Ballesteros-Weinstein numbers) in the conserved 
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Figure 1 | The M2 receptor with bound QNB. a-e, The M2 receptor is shown 
as a blue ribbon and QNB as orange spheres. a, M2 receptor in profile. 

b, Cytoplasmic surface showing conserved DRY residues in TM3. 

c, Extracellular view into QNB binding pocket. d, Extracellular view with 
solvent-accessible-surface rendering shows a funnel-shaped vestibule and a 


E/DRY sequence in TM3 and Glu 382°°° in TM6 (Fig. 1b). Instead, the 
Arg121°*° side chain forms a salt bridge only with Asp 120°”. In 
rhodopsin, the homologous residues form part of a charge-charge 
interaction that stabilizes the cytoplasmic ends of TM3 and TM6 in an 
inactive state'’. The second intracellular loop shows a helical conforma- 
tion similar to that first seen for the turkey B, adrenergic receptor”. 
GPCR crystal structures show the greatest differences in the extra- 
cellular surface (Supplementary Fig. 3). The M2 receptor has a relatively 
simple and open extracellular surface (Fig. 1c, d) with the longer extra- 
cellular loop (ECL)2 stabilized by a conserved disulphide with Cys 96°” 
at the N terminus of TM3 and Cys176 in the middle of ECL2. In 
addition, the second disulphide bond was detected between C413 and 
C416 in the ECL3. The extracellular surface of the M2 receptor most 
resembles that of the dopamine D3 receptor (Supplementary Fig. 3). 
Crystal structures of GPCRs reveal a network of hydrogen bonding 
interactions that extend from the binding pocket to the cytoplasmic 
surface. However, a distinctive feature of the M2 receptor is that this 
network is part of a long, continuous aqueous channel extending from 
the extracellular surface to a depth of approximately 33 A when 
measured from ECL2 (Fig. le). This channel contains the ligand bind- 
ing pocket, but extends beyond the ligand and is separated from the 
cytoplasmic surface by a hydrophobic layer formed by three amino 
acids: Leu65~*° in TM2, Leu 114" in TM4 and Ile 392°° in TM6. 
Each of these is absolutely conserved among all five muscarinic sub- 
types. The dimensions of the channel below the QNB binding site are 
large enough to accommodate a long, extended orthosteric ligand. 
Supplementary Fig. 4 compares the aqueous channels of other GPCRs. 
The ligand QNB binds within a deeply buried pocket defined by the 
side chains of TM3, 4, 5, 6 and 7 (Fig. 2a—c and Supplementary Fig. 5 
and Supplementary Table 3). An aromatic cage encloses the amine and 
forms a lid over the ligand, separating the orthosteric site from the 
extracellular vestibule. Asp 103°*” and Asn 404°°” serve to orient the 
ligand in the largely hydrophobic binding cavity, with Asn 404°°? 
forming paired hydrogen bonds with the hydroxyl and carbonyl groups 
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nearly buried QNB binding pocket. e, Aqueous channel (green) extending from 
the extracellular surface into the transmembrane core is interrupted by a layer 
of three hydrophobic residues (blue spheres). Well-ordered water molecules are 
shown as red dots. 


in QNB while Asp 103° engages in a charge—charge interaction with 
the amine moiety of the ligand (Fig. 2). The TM amino acids that form 
the QNB binding pocket are identical in all five muscarinic receptor 
subtypes (Supplementary Table 1), consistent with results of QNB 
binding experiments on M1-M4 receptors, and with site-directed 
mutagenesis experiments on M1 (ref. 19), M2 (ref. 20) and M3 
(ref. 21) receptors. Only Phe 181, which extends downward from 
ECL2 and interacts with one of the two phenyl rings on QNB 
(Fig. 2), differs from all other muscarinic receptor subtypes, which have 
leucine in the homologous position. The importance of Asp**” for both 
agonist and antagonist binding has been demonstrated in mutagenesis 
and covalent-labelling experiments and modelling studies’? ”*. In con- 
trast, mutation of Asn 404°” to Ala on M1 (ref. 23) and M3 (ref. 24) 
receptors was shown to greatly affect binding of QNB but have little 
effect on binding of or activation by acetylcholine. It is possible that 
Asn 404°°? is hydrogen bonded with the ester group of QNB but not 
that of acetylcholine. 

The M2 and other muscarinic receptors represent one of four families 
of acetylcholine binding proteins to be structurally characterized thus 
far. Figure 3a shows the orthosteric binding site of the M2 receptor with 
acetylcholine docked with the gauche form of the O-C2-C1-N dihedral 
angle, which places the choline group in the aromatic cage interacting 
with Asp 103°*”, while the carbonyl oxygen is tentatively bound to 
Asn 404°*? (Fig, 3a). The natural agonist acetylcholine is much smaller 
than the bulky antagonist QNB. As described in the agonist-bound 
structure of the B, adrenergic receptor, the contraction of the ligand 
binding pocket is expected as a result of an inward shift of TM5 (ref. 25). 
This result is consistent with the previous mutation studies showing that 
Thr 187°? and Thr 190°? in TM5 (Fig. 2) alter binding of most 
agonists but not of antagonists””. Bulky compounds capable of blocking 
activation-related contraction of the pocket would be very efficient in 
locking the M2 receptor in an inactive conformation, as is exemplified 
here by the antagonist QNB. It has been proposed that the con- 
formational change of the M2 receptor upon activation might be 
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Figure 2 | Binding interactions between the M2 receptor and QNB. 

a, b, Two views of the QNB binding pocket. Amino acids within 4 A of the 
ligand are shown as light blue sticks, with QNB in orange. Nitrogen and oxygen 
atoms are coloured dark blue and red, respectively. Polar interactions are 
indicated by dashed lines. A 2F, — F. map is shown in wire at 1.50 contour. c, A 
schematic representation of QNB binding interactions is shown. Mutations of 


accompanied by a conformational change of acetylcholine from the 
gauche to the trans form of the O-C2-C1-N dihedral angle*®. It 
remains to be determined in which pose acetylcholine binds to the 
M2 receptor or to the M2-receptor—G-protein complex, and whether 
acetylcholine hydrogen bonds with Asn 404° or other residues. 

Ina striking example of convergent evolution, the orthosteric site of 


the M2 receptor exhibits many features noted previously as common 
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Figure 3 | Convergent evolution of acetylcholine binding sites. 

a, Acetylcholine is modelled into the crystal structure of the M2 receptor. 
b, Acetylcholine binding pocket in the crystal structure of the acetylcholine 
binding protein from the snail Aplysia californica (PDB accession 2XZ5). 
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amino acids in the red boxes have been shown to reduce both antagonist and 
agonist binding by more than tenfold. Mutations of the amino acid in the 
purple boxes reduce antagonist binding affinity by more than tenfold. 
Mutations of amino acids in the blue boxes reduce agonist binding by more 
than tenfold. Blue dotted lines indicate potential hydrophobic interactions and 
red lines indicate potential polar interactions. 


structural elements in unrelated acetylcholine binding proteins”’. Like 
the M2 receptor, a nicotinic acetylcholine receptor homologue bound 
to acetylcholine (Fig. 3b) shows an aromatic cage comprised of three 
tyrosines and a tryptophan, although it notably lacks a counterion to 
the choline group”, whereas in the M2 receptor this role is filled by 
Asp 103°°*. A bacterial acetylcholine binding protein, ChoX, from 
Sinorhizobium meliloti (Fig. 3c) also possesses an aromatic cage, and 


b = Acetylcholine binding protein (snail) 


Acetylcholine 


Te 


ds Acetylcholine esterase (electric ray) 


a0 


Thioacetylcholine 


c, Acetylcholine binding pocket in the acetylcholine binding protein ChoX 
from the Gram negative bacterium Sinorhizobium meliloti (PDB accession 
2RIN). d, Binding site for thio-acetylcholine in the enzyme acetylcholine 
esterase from the electric ray Torpedo californica (PDB accession 2C4H). 
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Extracellular 


Figure 4 | Allosteric binding in the M2 receptor. a, Differences between the 
M2 and M4 receptors are shown as green residues mapped onto the inner 
surface of the M2 receptor (blue), with QNB in orange spheres. The sequence 
conservation within the orthosteric site is apparent, while residues outside show 
more variability. b-d, Mutations that alter allosteric binding are shown with 
yellow carbons, and amino acids involved in QNB binding are shown with blue 


like the M2 receptor has an aspartate in close proximity to the amine 
engaging in a charge-charge interaction”. Also like the M2 receptor, 
ChoX has an asparagine hydrogen bonding to the ligand carbonyl. Like 
these proteins, the enzyme acetylcholine esterase (Fig. 3d) uses an 
aromatic cage and a carboxylate to bind the choline group, while the 
(thio)acetyl group interacts with a phenylalanine, probably through 
m-m interactions®®. Taken together, these structures suggest that an 
aromatic cage and buried carboxylate are likely to be critical elements 
for acetylcholine recognition and binding in general. 

There is a growing interest in the development of allosteric ligands for 
GPCR targets. This is motivated by the ability to develop more subtype- 
selective drugs targeted at less conserved regions of the receptor. 
Moreover, allosteric ligands modulate the effects of natural hormones 
and neurotransmitters, and may therefore regulate receptor activity in 
a more physiological manner. As noted above, the orthosteric binding 
pocket is highly conserved among all muscarinic receptor subtypes. 
Allosteric regulation of GPCRs was first observed for the M2 receptor 
and this receptor has been one of the most extensively characterized 
allosteric model systems’. Figure 4a shows the inner surface of the M2 
receptor, highlighting residues that are not conserved with its closest 
relative, the M4 receptor. It can be seen that the orthosteric binding 
pocket and transmembrane core are highly conserved. The greatest 
diversity is observed in the extracellular loops and the extracellular end 
of TM segments that form the entrance to the orthosteric binding 
pocket. These amino acids represent structural diversity that could 
be exploited for the development of more subtype-selective ligands’. 
Of interest, site-directed mutagenesis and chimaeric receptor studies 
have implicated several of these amino acids in the binding of several 
well-characterized allosteric modulators’. As shown in Fig. 4b-d, these 
residues are located in ECL2 and the amino-terminus of TM7 at the 
entrance to the binding pocket. Trp 422”*», a residue implicated in the 
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carbons as sticks or spheres. b, c, Different views of possible allosteric binding 
sites in the M2 receptor. The surface view in c shows the positions of possible 
allosteric binding sites (yellow) lining the path to the QNB binding pocket. 

d, Trp 422 (yellow spheres), implicated in binding of allosteric ligands, forms an 
edge-to-face aromatic interaction with Tyr 403, part of the aromatic cage (blue 
spheres) of the orthosteric site. 


binding of several allosteric modulators, appears to form an edge-to- 
face n—T interaction with Tyr 403°", part of the aromatic cage sur- 
rounding the charged amine of the orthosteric ligand (Fig. 4d). Binding 
of allosteric ligands to this site would be expected to influence the 
association and disassociation rates of orthosteric ligands. 

The structure of the M2 receptor provides insights into both orthosteric 
and allosteric regulation of muscarinic receptors. The development of 
more selective drugs for muscarinic receptors will probably require 
exploitation of the more diverse allosteric surface, either as exclusively 
allosteric ligands or as ligands that occupy both orthosteric and 
allosteric sites. 


METHODS SUMMARY 


Untagged human M2 muscarinic acetylcholine receptor was expressed in Sf9 cells 
with the IL3 replaced with T4 lysozyme, then extracted with digitonin and sodium 
cholate and purified by ligand affinity chromatography, then exchanged into decyl 
maltoside buffer. Purified receptor was crystallized by the lipidic cubic phase 
technique following addition of a stabilizing neopentyl glycol detergent. Data 
collection was performed at Advanced Photon Source beamlines 23ID-B and 
23ID-D, and the structure solved by molecular replacement. Refinement statistics 
are given in Supplementary Table 2. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Construction of M2-TAL expression vectors for Sf9 cells. The coding sequence 
of the human M2-T4L receptor fusion protein was designed to have N-linked 
glycosylation sites (Asn 2, Asn 3, Asn6 and Asn9) mutated to aspartic acid and 
cysteine-less T4L (C54T, C97A) residues 2-161 inserted into the IL3, replacing 
M2 residues 218-376. This construct was synthesized (TAKARA Bio), and cloned 
into the pFastbacl Sf9 expression vector (Invitrogen) as illustrated in Supplementary 
Fig. la. A TAA stop codon was placed after the R466 codon, terminating translation. 
The synthesized M2-T4L receptor described above was confirmed by sequencing. 
Expression and membrane preparation. Recombinant baculovirus was made 
from pFastbacl-M2-T4L using the Bac-to-Bac system (Invitrogen)*". The M2- 
TAL protein was expressed in baculovirus-infected Sf9 insect cells as described 
previously”. Sf9 insect cells were prepared at a density of 1.0 X 10° cells ml ' and 
suspended in 5 litres of the IPL-41/SF900 II complex media or ESF921 insect 
media. Media containing Sf9 insect cells were transferred into the CELLBAG 22 
L/O (GE Healthcare) and cultured for 4 days with the following culture conditions: 
20 r.p.m., 8.5° rocking angle, 30% Oo, 0.251 min | ofair flow rate and 27 °C. After 
4 days, 200 to 300 ml of the M2-T4L baculovirus stock (approximate multiplicity 
of infection (m.o.i.), 2) and 700 to 800 ml of IPL-41/SF900 II complex media were 
transferred into the CELLBAG (final culture volume, 6 litres) and infected for 2 
days under the following infection conditions: 22 r.p.m., 8.5° rocking angle, 50% 
O,, air flow rate, 0.25lmin ' and 27°C. Two days later, a fraction of the cells was 
harvested for the binding assay and the remaining cells were centrifuged at 6,000g¢ 
for 10 min and harvested. The cell pellet was washed with 250 ml of PBS without 
calcium chloride and magnesium chloride (PBS(—)) and resuspended with 100 ml 
of PBS(—) containing a protease inhibitor cocktail tablet (Roche). Final concen- 
tration of protease inhibitors was 2.5 1g ml’ pepstatin, 2 4g ml ’ PMSF, 20 pg 
ml ' leupeptin and 0.5 mM benzamidine. Cells were quick frozen in liquid nitro- 
gen and stored at —80 °C. 

The membrane was prepared from the M2-T4L-expressing Sf9 insect cells as 

described previously*’. For the preparation of membranes from insect cells, Sf9 
insect cells were centrifuged at 1,500g for 10 min at 4 °C. The pellet was washed 
with PBS(—), then resuspended in 100 ml of hypotonic buffer containing 10 mM 
HEPES at pH 7.5, 20mM KCl, 10mM MgCl and protease inhibitor cocktail, 
followed by Dounce homogenization to resuspend the membranes. Insect cell 
membranes were centrifuged at 100,000g for 30 min and the pellets were resus- 
pended in 10 mM HEPES at pH 7.5, 10 mM MgCl, 20 mM KCl, 40% glycerol, and 
snap-frozen in liquid nitrogen and then stored at —80°C until use. Membrane 
proteins were quantified using the bicinchoninic acid (BCA) method (Pierce) 
using a BSA standard. 
Purification of M2-T4L-QNB. M2-T4L was expressed in Sf9 cells, solubilized 
with digitonin/Na-cholate solution, and purified by using an affinity column with 
aminobenztropine (ABT) as a ligand*’, as described below. The whole procedure 
was carried out at 4 °C. Sf9 membrane preparations with 2.1 kg of wet weight and 
approximately 1.5 jumol of [*H]QNB binding sites were solubilized with 1% digi- 
tonin, 0.35% Na-cholate, 10mM K-phosphate buffer (pH 7.0) (KPB), 50 mM 
NaCl, 1mM EDTA, a cocktail of protease inhibitors (41). The supernatant was 
applied to two ABT columns run in parallel (500 ml each), followed by washing 
with 0.1% digitonin, 0.1% Na-cholate, 20 mM KPB, 150mM NaC] (21 X 2) ata 
rate of approximately 90 mlh'. M2-T4L was eluted from the ABT columns with 
0.5 mM atropine, 0.1% digitonin, 0.1% Na-cholate, 20 mM KPB, 150 mM NaCl in 
21 elution volume for each column, and was bound to a column of hydroxyapatite 
(30 ml), which was washed at a rate of 30-50 mlh ! with a series of solutions as 
follows: (1) 0.1% digitonin, 0.1% Na-cholate, 20 mM KPB (100 ml); (2) 5 uM QNB, 
0.1% digitonin, 0.1% Na-cholate, 20mM KPB (600 ml); (3) 0.35% Na-cholate, 
20mM KPB (600 ml); (4) 0.2% decylmaltoside, 20 mM KPB (500 ml); (5) 0.2% 
decylmaltoside, 150 mM KPB (100 ml); (6) 0.2% decylmaltoside, 500 mM KPB 
(60 ml). M2-T4L-QNB was finally eluted with 0.2% decylmaltoside, 1M KPB 
(50 ml). The eluate was concentrated to approximately 1ml (approximately 
30 mg protein per ml) with Amicon Ultra (MILLIPORE), followed by dialysis 
against 0.2% decylmaltoside, 20mM Tris-HCl buffer (pH 7.5) and storage in 
—80 °C. The yield was estimated to be approximately 50% on the assumption that 
the recovered protein is pure M2-T4L. Protein concentration was determined 
using BCA Protein Assay (PIERCE). Because we purified M2-T4L as a complex 
with QNB we could not estimate the [7H]QNB binding activity because the dis- 
sociation rate of QNB is too slow. However, in preliminary experiments using 
[H]QNB or dissociable atropine as eluants, we confirmed that the receptor is 
purified to near homogeneity. The purity of M2-T4L was confirmed by SDS- 
PAGE and gel permeation chromatography (Supplementary Fig. 6). All the 
QNB used in purification and crystallization was the high-affinity enantiomer, 
R-(—)-3-QNB. 


Measurement of ligand binding activity. Ligand binding activity of wild-type 
M2 and M2-T4L receptors was determined as described previously™. Briefly, the 
receptors solubilized from Sf9 membranes were incubated with 0.1-4nM 
(H]QNB with or without 1 uM atropine, or with 2nM PH]QNB with various 
concentrations of carbamylcholine or atropine in 0.1% digitonin, 20 mM KPB for 
60 min at 30 °C (total volume 0.2 ml). The amount of [>H]QNB bound to receptors 
was assayed by using a small column of Sephadex G50 fine (2 ml). The density of 
[PH] QNB binding sites in the particulate fraction of M2-T4L was 17 pmol per mg 
of protein on average and ranged from 5.3-35 pmol per mg of total protein. 
Crystallization. QNB-bound M2-T4L was concentrated to 20mg ml ' in decyl 
maltoside buffer in a volume of approximately 100 pl. A 10% stock solution of 
lauryl maltose neopentyl glycol detergent (MNG, Anatrace) with 100 mM NaCl 
and 20 mM HEPES pH 7.5 was then added to the protein to a final concentration 
of 1% (w/v) of MNG detergent. The sample was incubated for 1h on ice, then 
diluted to 1 ml in 0.1% MNG buffer and reconcentrated to 50mg ml * before 
reconstitution. The final volume of protein sample at this concentration was 
typically 20-30 kl. Protein was reconstituted in cubic phase by mixing with a 
1.5-fold weight excess of a 10:1 monoolein:cholesterol mix by the twin-syringe 
method”. Briefly, the protein and lipid were mixed by passage through coupled 
syringes 100 times either by hand or using a Gryphon LCP robot (Art Robbins 
Instruments). The reconstituted protein was dispensed using a modified ratchet 
device (Hamilton) or using the Gryphon LCP robot in 40 nl drops to either 24-well 
or 96-well glass sandwich plates and overlaid with 0.8 ul precipitant solution. A 
single crystallization lead was initially identified using an in-house screen and then 
optimized. Crystals for data collection were grown in 25-35% PEG 300, 100 mM 
ammonium phosphate, 2% 2-methyl-2,4-pentanediol, 100 mM HEPES pH 7.0- 
7.8. Crystals reached full size and were harvested after 3-4 days at 20 °C. Typical 
crystals are shown in Supplementary Fig. 7. 

Data collection and processing. Diffraction data were measured at the Advanced 
Photon Source beamlines 23 ID-B and 23 ID-D. Several hundred crystals were 
screened, and a final data set was compiled using diffraction wedges of typically 5 
degrees from the 23 most strongly diffracting crystals. Data reduction was per- 
formed using HKL2000°°. Diffraction quality was very heterogeneous, with some 
crystals diffracting to 2.3 A whereas others failed to diffract past 3.5 A. Among the 
best crystals, most diffracted to 3.0-2.5 A. Severe radiation damage and aniso- 
tropic diffraction resulted in low completeness in higher resolution shells. We 
report this structure to an overall resolution of 3.0 A. Despite the low completeness 
in high resolution bins, inclusion of these reflections significantly improved map 
quality. Highest shell <I>/<cI> is relatively low, in large part due to anisotropy 
of the diffraction. The final resolution cut-off was chosen on the basis of complete- 
ness and <I>/<oI> in the spherical highest shell, but analysis of average F/oF 
values along reciprocal space axes suggests resolution limits (based on average 
F/oF> 3) of 3.5, 2.9 and2.7A along a*, b* and c*, respectively. The real space 
c-axis is normal to the plane of the lipid membrane in the crystal. 

Structure solution and refinement. The structure was solved by molecular 
replacement using Phaser*’”** with the structure of the inactive B, adrenergic 
receptor and T4L used as search models (PDB accession 2RH1). The initial 
molecular replacement model was further fitted by rigid body refinement followed 
by simulated annealing and restrained refinement in Phenix”. Iterative manual 
rebuilding and refinement steps were performed with Coot and phenix.refine, 
respectively. Figures were prepared with PyMOL, and Ramachandran statistics 
were calculated with MolProbity. 
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Acetylcholine, the first neurotransmitter to be identified’, exerts 
many of its physiological actions via activation of a family of 
G-protein-coupled receptors (GPCRs) known as muscarinic 
acetylcholine receptors (mAChRs). Although the five mAChR 
subtypes (M1-M5) share a high degree of sequence homology, they 
show pronounced differences in G-protein coupling preference 
and the physiological responses they mediate” *. Unfortunately, 
despite decades of effort, no therapeutic agents endowed with clear 
mAChR subtype selectivity have been developed to exploit these 
differences**. We describe here the structure of the Gg/;;-coupled 
M3 mAChR (‘M3 receptor’, from rat) bound to the bronchodilator 
drug tiotropium and identify the binding mode for this clinically 
important drug. This structure, together with that of the G;/,- 
coupled M2 receptor’, offers possibilities for the design of 
mAChR subtype-selective ligands. Importantly, the M3 receptor 
structure allows a structural comparison between two members of 
a mammalian GPCR subfamily displaying different G-protein 
coupling selectivities. Furthermore, molecular dynamics simula- 
tions suggest that tiotropium binds transiently to an allosteric 
site en route to the binding pocket of both receptors. These simula- 
tions offer a structural view of an allosteric binding mode for an 
orthosteric GPCR ligand and provide additional opportunities for 
the design of ligands with different affinities or binding kinetics for 
different mAChR subtypes. Our findings not only offer insights 
into the structure and function of one of the most important GPCR 
families, but may also facilitate the design of improved therapeutics 
targeting these critical receptors. 

The mAChR family consists of five subtypes, M1-M5, which can be 
subdivided into two major classes (Fig. la). The M1, M3 and M5 
receptors show selectivity for G proteins of the G,/,; family (that is, 
G, and Gj), whereas the M2 and M4 receptors preferentially couple to 
Gijo-type G proteins (G; and G,)”*. The development of small molecule 
ligands that can selectively act on specific mAChR subtypes has proven 
extremely challenging, primarily owing to the high degree of sequence 
similarity in the transmembrane (TM) core of these receptors” *. More 
recently, considerable progress has been made in targeting drugs to 
non-classical (allosteric) binding sites of certain mAChR subtypes’. 

Within the mAChR family, the M3 subtype mediates many import- 
ant physiological functions, including smooth muscle contraction and 
glandular secretion****”°. Central M3 receptors have also been impli- 
cated in the regulation of food intake’, learning and memory”, and the 
proper development of the anterior pituitary gland’’. Selective drugs 
targeted at this receptor subtype may prove clinically useful***"°, and 
non-selective muscarinic ligands are already widely used in current 
practice. 

Owing to the profound physiological importance of the M3 receptor 
and its long-standing role as a model system for understanding GPCR 
function*”’, we used the T4 lysozyme (T4L) fusion protein strategy’* to 


obtain crystals of Rattus norvegicus M3 receptor-T4L fusion protein 
(Supplementary Fig. 1) by lipidic cubic phase crystallization. Diffraction 
data from more than 70 crystals were merged to create a data set to 3.4 A 
resolution and to solve the structure by molecular replacement. The M3 
receptor structure, together with that of the M2 receptor’, affords an 
opportunity to compare two closely related mammalian receptors with 
divergent G-protein coupling selectivities. 

The overall structure of the M3 receptor is similar to that of M2 
(Fig. 1b-d). Surprisingly, structural conservation includes intracellular 
loops (ICLs) 1 and 2, and extracellular loops (ECLs) 1-3, which share 
highly similar overall folds despite low sequence conservation (Fig. If). 
Like the M2 receptor, the M3 receptor exhibits unique mAChR features, 
including a large extracellular vestibule as part of an extended hydro- 
philic channel containing the orthosteric binding site (Fig. le). Also 
like M2, the M3 receptor features a pronounced outward bend at the 
extracellular end of TM4 (Fig. 1d; Supplementary Fig. 2b). This bend, 
not seen in any other GPCR family crystallized so far, is stabilized by a 
hydrogen bond from the Q207 (Q163 in M2) side chain to the L204 
backbone peptide carbonyl (Supplementary Fig. 2b). This bond is part 
of a polar interaction network involving four residues absolutely con- 
served within the mAChR family, suggesting that this unusual feature 
is important to mAChRs in general. Indeed, mutagenesis of Q207 in 
M3 impaired both ligand binding and receptor activation’. 

The M3 receptor was crystallized in complex with tiotropium 
(Spiriva), a potent muscarinic inverse agonist'*’* used clinically for 
the treatment of chronic obstructive pulmonary disease (COPD). The 
M2 receptor was crystallized in complex with R-(—)-3-quinuclidinyl 
benzilate (QNB) which, like tiotropium, is a non-subtype-selective 
mAChR blocker’*”*. The two ligands bind in remarkably similar poses 
(Fig. 2b), and it is likely that this pose represents a conserved binding 
mode for structurally similar anticholinergics. In the M3 receptor, as in 
M2, the ligand is deeply buried within the TM receptor core (Fig. 2a, d) 
and is covered by a lid comprising three conserved tyrosines— 
y148°°3, 506°"! and Y5297°? (Fig. 2a; superscripts indicate 
Ballesteros-Weinstein numbers'’). The ligand is almost completely 
occluded from solvent and engages in extensive hydrophobic contacts 
with the receptor. A pair of hydrogen bonds are formed from N507°°* 
to the ligand carbonyl and hydroxyl, while D147°* interacts with the 
ligand amine. 

Reflecting the difficulty in developing subtype-selective orthosteric 
ligands, the residues forming the orthosteric binding pocket are 
absolutely conserved among the five mAChR subtypes (Fig. If). 
However, this conservation at the amino acid level does not preclude 
the existence of differences in the three-dimensional architecture of the 
orthosteric site between the different mAChR subtypes. In fact, com- 
parison of the structures of the M3 and M2 receptor ligand binding 
sites reveals structural divergences that might be exploited in the 
development of subtype-selective ligands. 
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Figure 1 | Major structural features of the M3 receptor. a, Analysis of accessible surface for the M3 receptor bound to tiotropium (spheres) shows the 
muscarinic receptor sequences divides them into two classes. b, The overall receptor covering the ligand with a tyrosine lid (outlined in red). The surface of 
structure of the M3 receptor (green) is similar to that of the M2 receptor the receptor is shown in green and its interior in black. f, M3 receptor structure 
(orange). The M3-bound ligand, tiotropium, is shown as spheres coloured coloured by sequence conservation among the five mAChR subtypes. Poorly 


according to element, with carbon in yellow and oxygen in red.c, Comparison _ conserved regions are shown with larger backbone diameter. The orthosteric 
of the intracellular surfaces shows divergence in the cytoplasmic end of TM5. and allosteric sites are indicated in blue and red elliptical shaded areas, 


d, Comparison of the extracellular surfaces shows less deviation, with near respectively, and the ligand tiotropium is shown as spheres. 
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Figure 2 | Orthosteric binding sites of the M2 and M3 receptors. In all similar poses (bottom). c, There is a Phe (M2)/Leu (M3) sequence difference 
panels, the M3 receptor is shown green with its ligand tiotropium in yellow, between the M2 and M3 receptors near the binding site. d, This produces an 
while the M2 receptor and its ligand QNB are shown in orange and cyan, enlarged binding pocket in the M3 receptor, outlined in red and indicated with 
respectively. a, Tiotropium binding site in the M3 receptor. A 2F, — F, map an arrow. e, Displacements of M3 Y529’*° and D147°” are seen (black dashed 
contoured at 2o is shown as mesh. b, Chemical structures of ligands. A red lines). f, The displacement of 52979 may arise from a sequence difference at 
arrow indicates the tropane C3 atom used as a tracking landmark in Fig. 3. position 2.61 (Tyr 80 in M2 and Phe 124 in M3). 


Superimposing the receptor structures reveals that the two ligands adopt highly 
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One such difference derives from the replacement of Phe 181 in 
ECL2 of M2 with Leu225 in M3 (this residue is leucine in all 
mAChRs except M2). This creates a pocket in M3 not found in M2 
(Fig. 2c, d). A second difference is a 2.8 A shift of Tyr5297°° relative to 
the position of the corresponding M2 residue (Tyr 426; Fig. 2e). This 
feature may derive from a difference in the identity of the residue in 
position 2.61 (Phe 124 in M3 and Tyr 80 in M2; Fig. 2f). This residue 
interacts directly with TM7, influencing the position of this helix and 
the residues within it, including Tyr529’*°. Notably, the residue at 
position 2.61 is not a part of the orthosteric binding pocket, but is 
positioned near a probable allosteric binding site’. Because tiotropium 
and QNB are structurally similar but not identical, the observed bind- 
ing site differences must be interpreted with some degree of caution. 
However, site-directed mutagenesis studies with M1 and M3 receptors 
support the concept that the residue at position 2.61 plays a role in 
receptor activation’*”” and ligand binding selectivity”®. This site does 
not appear to play a role in determining antagonist dissociation rates, 
because mutation of M3 F*°' to tyrosine or of M2 Y~*' to phenylalanine 
had no effect on dissociation rates for [H]N-methyl scopolamine 
(PHJNMS) or [*H]QNB. 

Weused molecular dynamics simulations to characterize the pathway 
by which tiotropium binds to and dissociates from the M2 and M3 
receptors. Similar techniques have previously been shown to correctly 


Figure 3 | Molecular dynamics of ligand binding. Simulations suggest that 
the tiotropium binding/dissociation pathway for both receptors involves a 
metastable state in the extracellular vestibule. a, When tiotropium is pushed out 
of the binding pocket of M3, it pauses in the extracellular vestibule in the region 
outlined with a dashed circle. Spheres represent positions of the ligand’s C3 
tropane atom at successive points in time. The direction of motion is indicated 
by an arrow. b, When tiotropium is placed in solvent, it binds to the same site in 
the extracellular vestibule. Our simulations are insufficiently long for it to 
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predict crystallographic ligand binding poses and kinetics in studies of 
B-adrenergic receptors”. In both the M2 and M3 receptors, our simu- 
lations indicate that as tiotropium binds to or dissociates from the 
receptor, it pauses at an alternative binding site in the extracellular 
vestibule (Fig. 3, Supplementary Fig. 3). Intriguingly, this site corre- 
sponds to an allosteric site that has been previously identified by 
mutagenesis’, a finding consistent with pharmacological studies show- 
ing that orthosteric ligands can act as allosteric modulators at the M2 
receptor. Tiotropium adopts different preferred allosteric binding 
poses in M2 and M3 (Fig. 3d, Supplementary Fig. 4). These metastable 
binding poses, which appear independently in both binding and 
dissociation simulations, may represent the first structural view of a 
clinically used ‘orthosteric’ GPCR ligand binding to an experimentally 
validated allosteric site. Conceivably, therapeutic molecules could be 
rationally engineered to act independently as both allosteric and 
orthosteric ligands (in contrast to previously described bitopic ligands 
that bind at both orthosteric and allosteric sites simultaneously’). 
Tiotropium dissociates from M3 receptors more slowly than from 
M2 receptors, a phenomenon thought to provide clinically important 
‘kinetic selectivity’ of this drug for M3 receptors despite similar 
equilibrium binding affinities for both subtypes’*. In simulations with 
tiotropium bound, the portion of ECL2 nearest the binding pocket 
proved more mobile in M2 than in M3 (Supplementary Fig. 5), probably 


Free energy 


Unbound 


Extracellular 
vestibule 
Orthosteric site 


Binding coordinate 


proceed into the orthosteric binding pocket; the agonist acetylcholine (ACh), a 
much smaller molecule, bound spontaneously to the orthosteric site in similar 
simulations (Supplementary Methods). c, Schematic free-energy landscape for 
binding/dissociation. Differences between M2 and M3 are shown in orange and 
green, respectively, with the rest of the curve in black. d, Common binding poses 
for tiotropium in the extracellular vestibule of M2 (orange) and of M3 (green). 
Non-conserved residues that contact the ligand are shown as thin sticks. The 
location of the orthosteric site is indicated by tiotropium (as spheres). 
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owing to multiple sequence differences between the two receptor sub- 
types. This increased mobility disrupts a hydrophobic cluster involving 
a thiophene ring of tiotropium, the ECL2 residue Phe 181(M2)/ 
Leu 225(M3), and Tyr**’, facilitating movement of Phe 181/Leu 225 
away from the orthosteric site and rotation of Tyr*** towards TM4. 
In simulations of ligand dissociation, such motions clear a path for 
tiotropium’s egress from the orthosteric site to the extracellular 
vestibule. The increased mobility of ECL2 in M2 thus appears to facilitate 
tiotropium’s traversal of the largest energetic barrier on the binding/ 
dissociation pathway (Fig. 3c). Experimental measurements with 
wild-type and mutant receptors (M3 L225F and M2 F181L) suggest 
that the Leu 225/Phe 181 sequence difference alone is insufficient to 
explain the difference in off-rates (for practical reasons these measure- 
ments were performed with QNB rather than tiotropium; see Methods). 
One of the most interesting features of the M2 and M3 receptors is 
the fact that the two highly similar receptors display pronounced 
differences in G-protein coupling specificity. For this reason, the 
M2/M3 receptor pair has long served as an excellent model system 
to identify features contributing to the selectivity of coupling between 
GPCRs and G proteins’. As no simple sequence elements have been 
identified as general determinants of coupling specificity across GPCR 
families”, it is likely that recognition depends on features such as 
overall conformation in addition to specific inter-residue contacts. 
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Figure 4 | G-protein coupling specificity determinants. a, The M3 receptor 
shows displacement of TM5 relative to its position in M2, and a conserved 
tyrosine (M3 Tyr 250°°*) adopts different positions in the two receptors. Four 
TM6 residues near TM5 (AALS in M3, VTIL in M2; boxed) have been shown to 
be important coupling specificity determinants. b, ICL2 is also divergent 
between the two structures. Four residues previously implicated as specificity 
determinants” are shown, with residue numbers for M2 followed by M3. ¢, Plot 
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The M2 and M3 receptor structures show a significant difference in 
the position of the cytoplasmic end of TM5 and of ICL2 (Fig. 4a, b). 
The highly conserved tyrosine residue at position 5.58 (M3 Tyr250°°*, 
M2 Tyr206°°*) shows a clear deviation between the two receptors, 
pointing towards the core of the protein in M2, and away from the 
receptor towards the surrounding lipid bilayer in M3. Interestingly, 
mutagenesis studies have identified a tetrad of residues (“AALS in M3, 
‘VTIL’ in M2) located on the cytoplasmic end of TM6 that are critical 
in determining G-protein coupling selectivity**”*. In both structures, 
these residues interact directly with TM5 (Fig. 4a), and in the B, 
adrenergic receptor-G, complex” two of the four corresponding 
residues make contact with the carboxy-terminal helix of Ga,. M3 
Tyr 254° at the bottom of TMS also plays a role in activation of 
Gavi (ref. 28). In the M2 receptor structure, the corresponding residue 
(Ser 210°) is displaced by approximately 4 A relative to Tyr 254° in 
M3 (Fig. 4a). 

When we compared the position of TM5 in the M2 and M3 receptors 
to that in other GPCR structures, we found that it is M2-like in all G;/,- 
coupled receptors, whereas the two mammalian G,,;-coupled receptors 
solved to date exhibit another conformation (Fig. 4c, d). An important 
caveat here is that these structures have been solved using the T4L fusion 
strategy, and we cannot completely exclude the possibility that this 
approach perturbs the conformation of TM5 and TM6. However, in 
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of interhelical distances for crystallographically unique inactive GPCR 
structures published to date. Distances were measured between C, atoms of 
TMS residue 5.62 and T'M3 residue 3.54 (x-axis), and TM5 residue 5.62 and 
TM6 residue 6.37 (y-axis). GPCRs cluster by coupling specificity, although 
squid rhodopsin is an exception. GPCRs coupling preferentially to Gi, and 
those coupling to the homologous G protein G, cluster together. d, Structural 
alignment of mammalian G,,-coupled and Gg/;;-coupled receptor structures. 
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molecular dynamics simulations of M2 and M3 receptors without T4L, 
each of the receptors adopts a set of conformations that includes its own 
crystallographically observed conformation (Supplementary Fig. 6, 7). 
These simulations suggest that the observed conformations are unlikely 
to be artefacts of the crystallization methodology, though the crystal 
structures probably represent only one conformation among many 
adopted by the receptors in a biological context. 

The structure of the M3 receptor, together with that of the M2 
receptor’, offers a unique opportunity to directly compare the struc- 
tural properties of two members of a mammalian GPCR subfamily 
endowed with different G-protein coupling selectivities. Examination 
of the M3 structure has provided structural evidence of differences 
between ligand binding sites of mAChR subtypes that could be 
exploited for the design of more selective therapeutics. Moreover, 
computational studies have identified a pathway by which the 
COPD drug tiotropium may bind to and dissociate from the M3 
receptor, offering a structural view of an orthosteric GPCR ligand 
binding to an experimentally validated allosteric site. This information 
should facilitate the rational design of new muscarinic drugs exhibiting 
increased receptor subtype selectivity, potentially improving treatment 
for a wide variety of important clinical disorders. 


METHODS SUMMARY 


The M3 muscarinic receptor-T4 lysozyme fusion protein was expressed in Sf9 
insect cells and purified by nickel affinity chromatography followed by Flag 
antibody affinity chromatography and then size exclusion chromatography. It 
was crystallized using the lipidic cubic phase technique, and diffraction data were 
collected at the GM/CA-CAT beamline at the Advanced Photon Source at 
Argonne National Laboratory. The structure was solved by molecular replacement 
using merged data from 76 crystals. All-atom classical molecular dynamics (MD) 
simulations with explicitly represented lipids and water were performed using the 
CHARMM force field” on Anton”. Ligand-binding simulations included no 
artificial forces. Dissociation studies included a time-varying biasing term that 
gradually forces the ligand away from its crystallographic position, but not along 
any prespecified pathway or direction. Full details are provided in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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Expression and purification of M3 muscarinic receptor. The wild-type M3 
mACHhR contains several long, probably poorly ordered regions, including the 
extracellular amino-terminal domain and the third intracellular loop, making it a 
challenging candidate for crystallographic studies. To alleviate this problem, the M3 
receptor from R. norvegicus was modified to include a TEV protease recognition site 
in the N terminus and a hexahistidine tag at the carboxy terminus. Moreover, the 
third intracellular loop (residues 260-481) was replaced with T4 lysozyme residues 
1-161 in a manner described previously’’, with two different fusions tested. These 
modifications are shown in Supplementary Fig. 1, which also shows the final 
crystallization construct. 

The pharmacological properties of the construct were tested and compared to 
those of the wild-type receptor (Supplementary Fig. 8, Supplementary Table 1; see 
below for methods details). Both constructs showed almost identical affinity for 
antagonists, while the crystallization construct (M3-crys) showed somewhat 
higher affinity for the agonist ACh than the wild-type construct. A similar obser- 
vation has been noted previously in the B2 adrenergic receptor’. Studies with 
membranes prepared from transfected COS-7 cells showed that TEV cleavage 
of M3-crys (to remove most of the N-terminal tail) had no significant effect on 
ligand binding affinities (Supplementary Fig. 9). Moreover, the wild-type receptor 
and M3-crys, either cleaved with TEV or left uncleaved, showed very similar 
PHJQNB dissociation rate kinetics (Supplementary Fig. 10). As expected, the 
crystallization construct failed to stimulate agonist-dependent phosphoinositide 
hydrolysis in transfected COS cells (data not shown), probably because essential 
G-protein interacting regions in ICL3 were omitted from the construct and also 
because the T4 lysozyme fusion protein sterically blocks G-protein association. 

The crystallization construct was expressed in Sf9 cells using the baculovirus 
system in the presence of 1 {1M atropine. M3 receptors expressed in Sf9 cells are 
known to exhibit functional and pharmacological properties similar to M3 receptors 
expressed in mammalian cells*!. Infection was performed at 4 X 10° cells per ml and 
flasks were shaken at 27 °C for 60 h following infection. 

Cells were harvested by centrifugation, then lysed by osmotic shock in the 
presence of 1 11M tiotropium bromide (PharmaChem), which was present in all 
subsequent buffers. Receptor was extracted from cells using a Dounce homogenizer 
with a buffer of 0.75 M NaCl, 1% dodecyl maltoside (DDM), 0.03% cholesterol 
hemisuccinate (CHS), 30mM HEPES pH 7.5, and 30% glycerol. Iodoacetamide 
(2 mg ml ') was added to block reactive cysteines at this stage. Nickel-NTA agarose 
was added to the solubilized receptor without prior centrifugation, stirred for 2h, 
and then washed in batch with 100g spins for 5 min each. Washed resin was poured 
into a glass column, and receptor was eluted in 0.1% DDM, 0.03% CHS, 20 mM 
HEPES pH 7.5, 0.75 M NaCl and 250 mM imidazole. 

Nickel-NTA agarose resin-purified receptor was then loaded by gravity flow 

over anti-Flag M1 affinity resin. Following extensive washing, detergent was 
gradually exchanged over 1.5h into a buffer in which DDM was replaced with 
0.01% lauryl maltose neopentyl glycol (MNG), and the NaCl concentration was 
lowered to 100mM. MNG has been shown to be more effective at stabilizing 
muscarinic receptors than DDM”. Receptor was eluted with 0.2mg ml! Flag 
peptide and 5mM EDTA. TEV protease (1:10 w/w) was added and incubated 
with receptor for 1.5 h at room temperature to remove the flexible N-terminal tail. 
Receptor was then separated from TEV by size exclusion chromatography (SEC) 
on a Sephadex $200 column (GE Healthcare) in a buffer of 0.01% MNG, 0.001% 
CHS, 100 mM NaCl and 20 mM HEPES pH 7.5. Tiotropium was added to a final 
concentration of 101M following SEC. The resulting receptor preparation was 
pure and monomeric (Supplementary Fig. 11). Purification of unliganded M3 
receptor was also possible by this procedure, but the resulting preparation was 
polydisperse and unsuitable for crystallographic study. 
Crystallization and data collection. Purified M3 receptor was concentrated to 
60mgml ', then mixed with 1.5 parts by weight ofa 10:1 mix of monoolein with 
cholesterol (Sigma) using the two syringe reconstitution method”’. The resulting 
lipidic cubic phase mix was dispensed in 15 nl drops onto glass plates and overlaid 
with 600 nl precipitant solution using a Gryphon LCP robot (Art Robbins 
Instruments). Crystals grew after 2-3 days in precipitant solution consisting of 
27-38% PEG 300, 100mM HEPES pH 7.5, 1% (w/w) 1,2,3-heptanetriol, and 
100mM ammonium phosphate. Typical crystals are shown in Supplementary 
Fig. 12. 

Data collection was performed at Advanced Photon Source GM/CA-CAT 
beamlines 23ID-B and 23ID-D using a beam size of 5 or 10 tm for most crystals. 
Diffraction quality rapidly decayed following exposure, and wedges of typically 5° 
were collected and merged from 76 crystals using HKL2000™. Diffraction quality 
ranged from 3 to 4 A in most cases, with strong anisotropy evident in many frames. 
Most crystals tested showed evidence of epitaxial twinning, though in most cases one 
of the two twins dominated the observed diffraction pattern, allowing processing 
as a single crystal. A more extensive discussion of the twinning is given below. 
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Some contamination of diffraction measurements due to the twin-related reflec- 
tions was unavoidable, leading to slightly poorer merging statistics than is typical 
for data sets collected from many small crystals (Supplementary Table 2). Despite 
this, maps were generally of high quality and electron density was easily interpret- 
able (Supplementary Figs 13, 14), in part due to the availability of non-crystal- 
lographic symmetry. 

Analysis of <F>/<oF> along each of the three reciprocal space axes indicated 
that the diffraction was strong in two directions, and weak in the third direction, 
along the reciprocal space axis c* (Supplementary Fig. 15). Using <F>/<oF> 
greater than 3 as a guideline suggested a resolution cut-off of better than 3.2 A 
along a* and b*, and of 4.0 A along c*. We therefore applied an ellipsoidal trun- 
cation along these limits, and then applied an overall spherical truncation at 3.4 A 
due to low completeness in higher resolution shells. Fortunately, fourfold non- 
crystallographic symmetry (NCS) allowed for improved map quality with map 
sharpening followed by NCS averaging, largely alleviating the effects of anisotropic 
diffraction and epitaxial twinning to give highly interpretable maps 
(Supplementary Figs 13, 14) and allowing details of ligand recognition to be clearly 
resolved (Supplementary Table 3). 

The structure of the M3 receptor was solved using the structure of the M2 

muscarinic receptor’ as the search model in Phaser**. The model was improved 
through iterative refinement in Phenix” and manual rebuilding in Coot guided by 
both NCS averaged and unaveraged maps. NCS restraints were applied in initial 
refinement stages, and omitted in final refinement cycles to account for differences 
between NCS-related copies. The quality of the resulting structure was assessed 
using MolProbity”, and figures were prepared using PYMOL”. 
Epitaxial twinning. Crystals of the M3 receptor showed hallmarks of epitaxial 
twinning, such as mixed sharp and split spots, poor indexing, and many unpre- 
dicted reflections in some frames. In some cases diffraction from two distinct 
lattices was clearly visible, with a small fraction of reflections exactly superimposed 
from both lattices (Supplementary Fig. 16). In most cases one lattice dominated the 
diffraction pattern to such an extent that it could be easily processed as a single 
crystal. Intriguingly, the two indexing solutions were not equivalent cells but 
rather were two enantiomorphic P1 cells (Supplementary Table 2). 

As one of these two cells gave significantly better diffraction data than the other, 

data processing and refinement were only pursued in this case. Within the asym- 
metric unit, two layers of receptors and two layers of T4 lysozyme are present, but 
each of these four layers exhibits a different lattice packing (Supplementary Figs 
17, 18). The order in which these layers are stacked in the crystal defines a unique 
direction along c, the axis normal to the membrane plane. As P1 is a polar space 
group, the positive direction along c is uniquely defined, and the two possible 
orientations of the stacked layers of membrane relative to the positive direction 
along c distinguish the two twin crystal forms. 
Expression of M3 receptors in COS-7 cells, membrane preparation, and TEV 
treatment. COS-7 cells were cultured as described previously’. About 24h before 
transfections, ~1 X 10° cells were seeded into 100-mm dishes. Cells were trans- 
fected with 4 g per dish of receptor plasmid DNA using the Lipofectamine Plus 
kit (Invitrogen), according to the manufacturer’s instructions. The mammalian 
expression plasmid coding for the wild-type rat M3 receptor has been described 
previously“. The coding sequence of the modified M3 receptor construct used for 
crystallization studies (M3-crys; see Supplementary Fig. 8, Supplementary Table 1) 
was inserted into the pcDNA3.1(-) vector. Transfected cells were incubated with 
1 uM atropine for the last 24h of culture to increase receptor expression levels”. 
COS-7 cells were harvested ~48 h after transfections, and membranes were pre- 
pared as described” 

Membranes prepared from COS-7 cells transiently expressing M3-crys were 
resuspended in TEV protease digestion buffer (50 mM NaCl, 10 mM HEPES pH 
7.5 and 1mM EDTA) and incubated overnight with TEV protease (made in our 
laboratory, final concentration 1 1M) at 4 °C with rotation. Efficient removal of the 
N-terminal tail of M3-crys by TEV was confirmed by SDS-PAGE and immuno- 
blotting using a monoclonal anti-Flag antibody directed against the N terminus of 
M3-crys. TEV-treated membranes were resuspended in either buffer A (25 mM 
sodium phosphate and 5 mM MgCl, pH 7.4) for radioligand binding studies or in 
sodium potassium phosphate buffer (4 mM Na,HPO,, 1 mM KH3PO,, pH 7.4) for 
PH]QNB dissociation assays (see below). 

Radioligand binding studies. [*H]N-methylscopolamine ([*H]NMS) saturation 
and competition binding studies were carried out essentially as described prev- 
iously*’. In brief, membrane homogenates prepared from transfected COS-7 cells 
(~10-20 pig of membrane protein per tube) were incubated with the muscarinic 
antagonist/inverse agonist, [7H]NMS, for 3h at 22°C in 0.5 ml of binding buffer 
containing 25mM sodium phosphate and 5mM MgCl, (pH 7.4). In saturation 
binding assays, we employed six different [7H]NMS concentrations ranging from 
0.1 to 6nM. In competition binding assays, we studied the ability of tiotropium, 
atropine or acetylcholine to interfere with [*HJNMS (0.5nM) binding. 
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Incubations were carried out for 20 h in the case of tiotropium in order to achieve 
equilibrium binding” (3 h for all other ligands). Non-specific binding was assessed 
as binding remaining in the presence of 1 1M atropine. Binding reactions were 
terminated by rapid filtration over GF/C Brandel filters, followed by three washes 
(~4 ml per wash) with ice-cold distilled water. The amount of radioactivity that 
remained bound to the filters was determined by liquid scintillation spectrometry. 
Ligand binding data were analysed using the nonlinear curve-fitting program 
Prism 4.0 (GraphPad Software Inc.). 

Atropine sulphate and acetylcholine chloride were from Sigma-Aldrich. 
Tiotropium bromide was purchased from W&J PharmaChem, Inc. PHINMS 
(specific activity: 85.0 Cimmol ') was obtained from PerkinElmer Life Sciences. 
(PHIQNB dissociation rate assays. (HJQNB (PerkinElmer; specific activity, 
50.5Cimmol ') dissociation rate assays were carried out as described previ- 
ously*’. Measurements were carried out at 37 °C in a total volume of 620 ull using 
a buffer consisting of 4mM Na:HPO, and 1 mM KH>PO, (pH 7.4). Membranes 
prepared from transfected COS-7 cells (final protein concentration, 10 1g protein 
per ml) were prelabelled with 1nM CPHIQNB for 30 min. Dissociation of the 
labelled ligand was initiated by the addition of atropine (final concentration, 
3 1M). Incubations were terminated by filtration through GF/C Brandel fibre 
filters that had been pretreated with 0.1% polyethyleneimine, followed by two 
rinses with ice-cold distilled water. The amount of radioactivity that remained 
bound to the filters was determined by liquid scintillation spectrometry. 
Molecular dynamics. In all simulations, the receptor was embedded in a hydrated 
lipid bilayer with all atoms, including those in the lipids and water, represented 
explicitly. Simulations were performed on Anton”, a special-purpose computer 
designed to accelerate standard MD simulations by orders of magnitude. 
System set-up and simulation protocol. Simulations of the M2 receptor were 
based on the crystal structure of the QNB-M2 complex, and simulations of M3 
were based on the structure of the tiotropium-M3 complex (chain A). These 
crystal structures were determined using a T4 lysozyme (T4L) fusion strategy, 
in which intracellular loop 3 (ICL3) of each receptor was replaced by T4L; the 
TAL sequence was omitted in our simulations. Residues 6.31-6.33 near the intra- 
cellular end of TM6 were unresolved in the M3 crystal structure, and residues 
6.27-6.30 were resolved in an unstructured conformation packed against T4L. 
Residues 6.27-6.36 were modelled manually as a helical extension of TM6, with 
side chains then placed using Prime. Hydrogens were added to the crystal struc- 
tures using Maestro (Schrédinger LLC), as described in previous work“. All 
titratable residues were left in the dominant protonation state at pH 7.0, except 
for Asp 69°’ in M2 and Asp 1147” in M3, which were protonated. Asp 69°? and 
Asp 114”°° correspond to rhodopsin Asp 83”*°, which is protonated during the 
entire photocycle®. 

Prepared protein structures were inserted into an equilibrated POPC bilayer as 
described*®. Sodium and chloride ions were added to neutralize the net charge of 
the system and to create a 150 mM solution. 

Simulations of the M3 receptor initially measured 80 X 80 X 87 A and con- 
tained 163 lipid molecules, 26 sodium ions, 41 chloride ions and approximately 
9,897 water molecules, for a total of ~56,000 atoms. Simulations of the M2 
receptor initially measured 79 X 79 X 85A and contained 156 lipid molecules, 
24 sodium ions, 35 chloride ions and approximately 9,165 water molecules, for 
a total of ~53,000 atoms. To simulate M2 with tiotropium bound, we removed the 
co-crystallized ligand, QNB, and docked in tiotropium using Glide (Schrédinger 
LLC). 

All simulations were equilibrated using Anton in the NPT ensemble at 310 K 
(37 °C) and 1 bar with 5 kcal mol” ' A~* harmonic position restraints applied to all 
non-hydrogen atoms of the protein and the ligand (except for the tiotropium-M2 
complex, where the ligand was unrestrained); these restraints were tapered off 
linearly over 50 ns. All bond lengths to hydrogen atoms were constrained using 
M-SHAKE”’. A RESPA integrator“ was used with a time step of 2 fs, and long-range 
electrostatics were computed every 6 fs. Production simulations were initiated 
from the final snapshot of the corresponding equilibration runs, with velocities 
sampled from the Boltzmann distribution at 310K, using the same integration 
scheme, long-range electrostatics method, temperature and pressure. Van der 
Waals and short-range electrostatic interactions were cut off at 13.5 A and long- 
range electrostatic interactions were computed using the k-space Gaussian Split 
Ewald method” with a 32 X 32 X 32 grid, o = 3.33 A, and o, = 2.33 A. 
Spontaneous binding of tiotropium and acetylcholine. We performed simula- 
tions where tiotropium was placed arbitrarily in the bulk solvent (at least 40 A 
from the entrance to the extracellular vestibule) and allowed to diffuse freely until 
it associated spontaneously with the M2 or M3 receptor, following methodology as 
described*". In these simulations (Supplementary Table 4, conditions D and E), the 
co-crystallized ligand was removed and four tiotropium molecules were placed in 
the bulk solvent. A tiotropium molecule bound to the extracellular vestibule at 
least once in each simulation. In the longer simulations, tiotropium bound to and 


dissociated from the extracellular vestibule multiple times. Tiotropium assumed 
several different poses when bound to the extracellular vestibule of either M2 or 
M3 (Supplementary Fig. 4). Tiotropium never entered the orthosteric binding 
pocket, presumably because the simulations were not of sufficient length. 

The fact that tiotropium associated with and dissociated from the vestibule 
multiple times, but did not enter the binding pocket, suggests that tiotropium 
must traverse a larger energetic barrier to enter the binding pocket of M2 or M3 
from the extracellular vestibule than to enter the vestibule from bulk solvent. This 
contrasts with earlier simulations on alprenolol binding to the B,-adrenergic 
receptor, in which the largest energetic barrier (by a small margin) was between 
the bulk solvent and the extracellular vestibule’. This difference probably reflects 
the fact that ligands must pass through a much tighter passageway to enter the 
binding pocket of the M2 and M3 receptors from the vestibule than is the case for 
the B-adrenergic receptor. Tiotropium lost the majority of its hydration shell as it 
entered the vestibule (Supplementary Fig. 19), as observed previously for ligands 
binding to B-adrenergic receptors”. 

We followed a similar protocol in a simulation of the M3 receptor in the 

presence of the agonist ACh, a smaller molecule which might be expected to bind 
faster (Supplementary Table 4, condition F). Indeed, an ACh molecule bound in 
the orthosteric binding pocket after 9.5 ,1s and remained there for the remainder of 
the 25-us simulation. Although ACh quickly passed through the extracellular 
vestibule en route to the binding pocket, it did not exhibit metastable binding in 
the vestibule. ACh exhibited significant mobility in the binding pocket, probably 
reflecting the low affinity of the crystallized inactive state for agonists. 
Forced dissociation of tiotropium. To identify the entire binding/dissociation 
pathway, we ‘pushed’ tiotropium out of the binding pocket of both the M2 and M3 
receptors”’*'. Production simulations were initiated from configurations of the 
corresponding unbiased trajectory. These simulations employed a time-dependent 
harmonic biasing potential, U(#): 


U(t) = V2 kd — do(t)) 


where t is time, k is a force constant in units of kcal mol! A~?, d is the distance 
between the centre-of-mass of the heavy atoms of tiotropium and the centre-of- 
mass of the protein Cx atoms, and dp(t) varied linearly over 1.0 1s, from 9.6 A to 
33 A for M2 and from 8.6 A to 32 A for M3. This biasing term does not impose 
any preferred direction of ligand exit. We performed seven such simulations for 
each of M2 and M3, with k = 5, starting from configurations extracted from the 
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Outside the box 


An industrial doctorate could help European students to break 
out of academia, but applied science is not for everyone. 


BY QUIRIN SCHIERMEIER 


hristian Hove Rasmussen isn’t worried 
( about getting a job. He is a PhD student 

at the Technical University of Denmark 
in Lyngby, but he is also an employee of Novo 
Nordisk, a pharmaceutical company based in 
Bagsveerd, where he is doing doctoral research 
on the biological mechanisms that govern the 
absorption of diabetes therapies, such as insulin, 
in the fat layer beneath the skin. His fixed-term 
contract expires in 2013, but Rasmussen is opti- 
mistic that his work experience will help him to 
secure a permanent position in industry. 

“Tm doing quite basic science, but it is target- 
oriented and I have to think about how it can 
be translated into medicine,” he says. “I think 
I can achieve a lot here.” Rasmussen's research 
— like that of around 60 other PhD candidates 
at Novo Nordisk — is funded by the Danish 
Industrial PhD Programme, a scheme set up 
by the Danish Agency for Science, Technology 
and Innovation (DASTI) in the 1970s. Last year, 
DASTI approved 116 industrial PhD projects, 
each funded by up to 882,000 Danish kroner 
(US$156,000). The host company gets a wage 
subsidy of 14,500 kroner per month for three 
years, and the university gets up to 360,000 kro- 
ner to last for the duration of the project. 

Industrial PhD programmes are start- 
ing across Europe. Some are structured, and 
include university coursework components. 
Others are more informal. In all cases, the 
commercial and industrial aspects of the 
research are overseen by company experts 
who take part in PhD supervision. If all goes 
well, the benefits are mutual: doctoral students 
develop an in-depth understanding of busi- 
ness, which facilitates employment; and their 
skills and discoveries help the company. But 
students must be willing to engage in applied 
research that conforms to industry needs. 


INDUSTRIAL EDUCATION 

Students have long interacted with industry 
outside ‘institutionalized’ PhD schemes, says 
Lidia Borrell-Damian, head of research and 
innovation at the European University Asso- 
ciation in Brussels. “It is not uncommon that 
during some point of a PhD a company is 
involved, in particular in science, technology, 
engineering and economics.’ Although explicit 
co-mentoring and co-funding have been avail- 
able in the United States for about 60 years, 
they are only now becoming widespread and 
accepted in Europe. 

In the United Kingdom, for example, 
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> the Engineering and Physical Sciences 
Research Council (EPSRC) runs 26 indus- 
trial doctorate centres, each of which recruits 
a dozen or so students from different back- 
grounds each year. In addition, the EPSRC’s 
industrial CASE programme provides support 
for individual PhD projects arranged between 
a company and an academic partner. 

In France, around 
10,000 science and 
engineering gradu- 
ates have completed a 
PhD under the Indus- 
trial Arrangements 
for Training Through 
Research (CIFRE) 
scheme, funded by 
the National Asso- 
ciation for Technical 
Research since 1981. 
The scheme is aimed 


mainly at students “I’m doing quite 
from the Ecole Poly- basic science, 
technique near Paris, but it is target- 
and companies post oriented. I think 
approved proposals Icanachievea 
on the university's lot here.” 
website. Interested = Christian Hove 
students can contact Rasmussen 


industrial partners to 
negotiate details and plan their dissertations. 

The European Commission (EC) last year 
launched a €20-million (US$26-million) 
industrial PhD initiative as part of its Marie 
Curie Actions funding programme, which 
promotes mobility among young scientists. 
Approximately 100 European Industrial Doc- 
torates will be funded under the pilot scheme, 
and the initiative is to become a permanent 
part of the European Horizon 2020 research- 
funding framework from 2014. Companies in 
the European Union or associated countries 
can submit research proposals without a spe- 
cific PhD candidate in mind, but must have a 
partner university in a different eligible nation. 

The structures of initiatives differ. Often, 
candidates spend 50% of their time or less at 
the university, and the rest at the company. 
Students at the ESPRC’s doctorate take uni- 
versity courses in cohorts, accounting for 25% 
of their PhDs, but do placements separately. 
For the CASE and EC programmes, students 
deal almost exclusively in company research, 
although they ultimately have to defend their 
theses at the university. 

The match-making processes also vary. In 
Denmark, students first need to find a com- 
pany and agree on a PhD project. Then they, 
along with their university advisers, submit 
applications to DASTI. In CIFRE and the EC 
scheme, companies submit PhD research pro- 
posals first, then recruit students. The EPSRC’s 
doctorate centres recruit students — accepting 
about 1 in every 20 applicants — and guide 
the match-making. Candidates for an EPSRC 
engineering doctorate in biopharmaceutical 


process development at Newcastle University, 
UK, have teamed up with companies in Brit- 
ain and abroad, including GlaxoSmithKline 
and AstraZeneca, both based in London; Uni- 
lever in Rotterdam, the Netherlands; Procter & 
Gamble in Cincinnati, Ohio; and Heineken in 
Amsterdam. Elaine Martin, a chemical engineer 
at Newcastle who oversees the programme, says 
that the scheme has proved so beneficial that 
many companies return to ask for fresh talent. 


OPENING DOORS 
Finding an industrial partner is often a chal- 
lenge, says Jane Thomsen, head of DASTT’s 
Industrial PhD Programme. “Students should 
have a very concrete research idea before they 
contact a company,’ she says. “Mere interest 
in some kind of collaboration is not enough” 
An internship or company placement often 
leads to a job or a more profound collabora- 
tion — Rasmussen, for example, did his mas- 
ter’s research at Novo Nordisk before his PhD. 

Similarly, Martina Hitzbleck became inter- 
ested in industrial research after a three- 
month college internship with IBM’s Almaden 
research centre in San Jose, California, in 2008. 
She learned that Emmanuel Delamarche, a sci- 
entist at IBM Zurich, was doing high-profile 
research on biosensor design — the subject of 
Hitzbleck’s master’s thesis — and she sent him 
her CV. After an interview and a presentation, 
Delamarche offered Hitzbleck a PhD project. 
He became her industrial supervisor, and Hitz- 
bleck found an academic mentor at the Swiss 
Federal Institute of Technology in Zurich. She 
spends 90% of her time with IBM. “T get all the 
support here that I could dream of, she says. 
“Whenever I have a problem, there is an expert 
on whose door I can knock.” 

There are currently 36 PhD projects under 


~~ 


way at IBM Zurich, mostly informal arrange- 
ments rather than part ofa programme. Oliver 
Ottow, head of human resources and university 
relations at the lab, says that the match-making 
generally takes place at scientific conferences 
and graduate recruitment fairs, or through 
IBM’s extended network of university contacts. 
“If students can convince us that their topics 
make a difference we'll find a home for them,” 
he says. “We just want to make sure they’re 
highly motivated and really very good” In prin- 
ciple, IBM gets research results and, possibly, a 
skilled future staff scientist. Students become 
familiar with applied, goal-oriented research, 
and learn about careers in industry, from patent 
law to media communication. 


REASONABLE PRECAUTIONS 

Students involved in informal collaborations 
should insist on a written agreement about 
the terms and conditions of the partnership, 
signed by themselves and all supervisors, says 
Borrell-Damian. “There is still a delicate bal- 
ance between the respective interests of stu- 
dents, university supervisors and companies,” 
she says. “A written agreement on all parties’ 
rights and duties is therefore essential?” Such 
agreements should specify topics and goals 
of the research, the required division of time 
and how intellectual-property rights will be 
assigned, she says. Company research depart- 
ments will usually own the intellectual prop- 
erty, but students should insist on the right to 
publish their findings. In general, universities 
still administer final exams, even if students are 
full-time company employees. 

There has been little research into the career 
paths of industrial PhD holders, but the few sur- 
veys that do exist suggest that graduates tend 
to stay in industry, usually in research. Even so, 


A college internship led Martina Hitzbleck to do a PhD at the IBM r 
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esearch centre in Zurich, Switzerland. 


IBM RESEARCH ZURICH 


the door to academia is not necessarily closed 
(see Nature 466, 402-403; 2010). Several former 
IBM researchers have moved on to become uni- 
versity faculty members, helped by the prestige 
of working in a high-profile industry lab. 

Some academics fear that industrial PhD stu- 
dents may not acquire the full set of skills and 
knowledge required for independent scientific 
research in academia, ranging from methods to 
research ethics. “Academia and industry have 
fundamentally different roles and it is not help- 
ful if they imitate each other,’ says Peter Blochl, 
a theoretical physicist at Clausthal University 
of Technology in Germany. He worked for ten 
years at IBM’s research laboratory in Rusch- 
likon, Switzerland, before moving to academia 
in 2000. Academia’s mission is pre-competi- 
tive research and student education, he says. 
“Companies can tap into this knowledge base 
to develop innovative products, but I see little 
purpose for a PhD in industry.’ Joint projects, 
consultantships and sabbaticals are more pro- 
ductive, says Bléchl. 

In general, students must be prepared for 
companies to tweak a research proposal to 
strengthen its commercial potential, says 
Thomsen. To avoid misunderstandings, they 
should make sure from the beginning that they 
are willing to accept such input. Early inter- 
views with company researchers should help, 
but subsequent problems are best addressed 
with the help of the supervisor. In structured 
programmes, the funding agency will also 
review any complaints, and may intervene. 

In the Danish scheme, says Thomsen, com- 
plaints are rare. If serious problems do arise — 
for example, if a host company goes bankrupt, 
or a PhD student is asked to do experiments or 
tasks unrelated to their project — the agency 
will try to mediate and, if necessary, demand 
the return of subsidies. 

Concerns that industry is in it mainly for 
the cheap labour, and that projects lack scien- 
tific depth, are unfounded, says Martin. “All 
companies we're working with — even the 
manufacturing sites — have a serious interest 
in supporting genuine research.” 

But as industrial PhDs become more com- 
mon, some do worry about exploitation of 
students and ‘over-industrialization of higher 
education. Eurodoc in Brussels, which lobbies 
for the rights of PhD candidates, warns that 
early-career research should be a free intellec- 
tual endeavour and not subject to the needs of 
business and industry. “Vanguard industrial 
PhD programmes are an opportunity to do 
what we all want to do — get work,” says Greg 
DeCuir, a dramatic-arts PhD student at the 
University of Arts in Belgrade, and a member 
of Eurodoc’s career-development group. “But 
we are not apprentices. If you feel that your sci- 
entific creativity is compromised, there should 
be aconcern.” = 


Quirin Schiermeier is Natures Germany 
correspondent. 
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TURNING POINT 
Christopher 


Christopher Wilson, a physicist at Chalmers 
University of Technology in Gothenburg, 
Sweden, led one of Physics World’ 2011 
‘breakthrough experiments: he and his 
team proved that a vacuum, rather than 
being completely empty, contains detectable 
virtual particles. He explains his motivation 
for taking a working sabbatical at a 
biotechnology start-up in California. 


You did your undergraduate degree at the 
Massachusetts Institute of Technology 

(MIT) in Cambridge. How did this affect your 
career? 

I was able to attend MIT after I won a 
US Naval Reserve Officers Training Corps 
scholarship, so I was expected to go into the 
navy afterwards. But I realized while I was 
at MIT, which is a very intense place, that I 
would rather do science. Between the second 
and third years of my degree, I notified the 
navy that I didn’t want to join. They could 
have drafted me, but they allowed me the 
option of paying back the scholarship money, 
which I’ve been doing ever since. 


How has your move to Chalmers influenced 
your research? 

I worked at Yale University in New Haven, 
Connecticut, for two years and then moved 
to Chalmers to work on a quantum comput- 
ing project, ostensibly for a year. ’'ve now 
been there for seven years. In Sweden, the 
work dynamic is hierarchical, like a company. 
There is a top professor who has several pro- 
fessors at different levels working under him 
or her — and younger researchers work their 
way up. It's a good system if you have a good 
boss. It gave me more time and freedom to 
get this one big experiment to work than I 
would have had in the United States. 


Describe your breakthrough experiment. 

When I got to Chalmers in 2004, my team 
started work on superconducting circuits 
for quantum computing. Around 2007, we 
realized that the work could allow us to meas- 
ure the virtual photons inside a vacuum. These 
virtual photons are generated and annihilated 
in pairs. About 40 years ago, it was suggested 
that a mirror moving near the speed of light 
could capture some of these photons. The 
effect had never been observed, because it is 
very hard to move a massive object that fast. 
We made an electronic ‘mirror’ that we could 
effectively move at one-quarter of the speed 
of light using magnetic fields. This allowed 
us to separate the pairs, stopping them from 
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annihilating and turning them into real pho- 
tons that we could observe (C. M. Wilson et al. 
Nature 479, 376-379; 2011). 


Could the media attention have a career 
benefit? 

It certainly helps to put the paper in a certain 
light, especially for people outside our physics 
sub-field. For example, when applying for 
jobs, you are evaluated by a whole depart- 
ment. It can be difficult even for other types 
of physicists to evaluate the details of papers. 


Why did you choose to take a sabbatical year 
at a start-up biotechnology company? 

Last July, I was promoted to associate pro- 
fessor, which is tenured at Chalmers. In the 
US system, it is typical to take a sabbatical 
after getting tenure. Sweden doesn't follow 
the same timing, nor does the university 
pay academics to go on sabbatical, but I had 
planned to do it. [happened to see an ad on 
a job-posting site from a start-up company 
working on biomedical devices and prote- 
omics. They needed someone skilled in 
algorithms and advanced statistical tools to 
analyse the enormous amount of data being 
generated about proteins, and I liked the peo- 
ple involved. It has turned out to be a good fit. 


What kind of career impact do you expect the 
sabbatical to have? 

I really wanted to do something to diversify 
my skills and develop some research lines 
that were completely my own — which can 
bea struggle in Europe’ hierarchical system. 
I want to see if I can contribute algorithms 
to the field of proteomics. It would be pure 
hubris to think I could jump into biology, but 
I would like to find collaborators and see if I 
can develop a new aspect of my research. = 


INTERVIEW BY VIRGINIA GEWIN 
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of physicists to evaluate the details of papers. 


Why did you choose to take a sabbatical year 
at a start-up biotechnology company? 

Last July, I was promoted to associate pro- 
fessor, which is tenured at Chalmers. In the 
US system, it is typical to take a sabbatical 
after getting tenure. Sweden doesn't follow 
the same timing, nor does the university 
pay academics to go on sabbatical, but I had 
planned to do it. [happened to see an ad on 
a job-posting site from a start-up company 
working on biomedical devices and prote- 
omics. They needed someone skilled in 
algorithms and advanced statistical tools to 
analyse the enormous amount of data being 
generated about proteins, and I liked the peo- 
ple involved. It has turned out to be a good fit. 


What kind of career impact do you expect the 
sabbatical to have? 

I really wanted to do something to diversify 
my skills and develop some research lines 
that were completely my own — which can 
bea struggle in Europe’ hierarchical system. 
I want to see if I can contribute algorithms 
to the field of proteomics. It would be pure 
hubris to think I could jump into biology, but 
I would like to find collaborators and see if I 
can develop a new aspect of my research. = 
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Uta SCIENCE FICTION 


GHOST IN THE MACHINE 


BY GRACE TANG 


make out the sound of Katie’s breath- 

ing. The first lines of light streamed in 
through the blinds, illuminating her toes. 
They crawled up her body, making their 
steady way up the folds of the covers, eventu- 
ally touching her face. She squeezed her eyes 
tight and groaned as the light rudely pierced 
her lids. Finally giving in, she rubbed the 
sleep from her eyes and looked back at me. 

“Good morning.” 

“Good morning :)” 

Katie rolled off her side of the bed, some- 
how managing to look beautiful while 
stumbling to the bathroom in her morn- 
ing stupor. I would have jumped into the 
shower with her, but God knows those days 
are behind me. 

The tap squeaked shut. Steam fogged up 
my vision as she emerged. It cleared in time 
for me to see her towel fall to her feet as she 
picked out her clothes for the day. 

“How did you sleep last night?” 

I didn’ tell her that I rarely slept any more. 
When I sleep, I dream. The air outside our 
house is crisp, filled with the shrill song of 
finches hidden in the canopy above us. You 
dont really notice them until they stop. I let 
go of Katie's hand and tell her to be quiet — I 
think I hear something. I walk ahead, careful 
not to make any noise. Then I hear a shout 
from one of the men in my squad — his 
scream is cut off by a gunshot. I kick up dust 
as I run, shouting at the top of my lungs, half 
to warn the rest about the ambush, half to 
drown out the sounds of gunfire at our backs. 
An explosion, and then pain. Blinding pain. 

“T slept well. You?” 

She took a while to check the monitor for 
my reply. 

“Like a baby:” 

“What are you doing at work today?” 

She was walking to the kitchen. There was 
another monitor there. I waited impatiently 
as she made coffee before checking to see 
if Td said anything. I was the result of mil- 
lions of dollars of research and they couldn't 
install text-to-voice... 

“You know, same old.” 

Small talk. I guess it beat the silence when 
she was away. 

“Oh, Brandon is com- 


I f I listened very carefully, I could barely 


ing bylater.Tocheckon NATURE.COM 

you.” Follow Futures on 
“Brandon?” Facebook at: 
“Doctor Johnson.” go.nature.com/mtoodm 


Computer love. 


Were they on first-name basis now? 

“Good that he’s coming. I’ve been having 
gaps in my playback” 

“Really?” Katie seemed fascinated by her 
coffee mug. She put it in the sink. 

“I should go now, gonna be late.” 

I watched her leave. An advantage of being 
like this was that my post-coma visual mem- 
ory was literally photographic. I spent the 
rest of the day going through old memory 
so that I could report the problem precisely 
to Dr Johnson. 

I went back to the day I was restored. Back 
then I had been disorientated and confused, 
I hardly noticed or cared about the details 


of my surroundings. But now I observed Dr 
Johnson as he talked to Katie — he was wear- 
ing an outfit that probably cost my entire pay 
cheque back when I was still in the military. 

“Thank you Doctor, you have no idea how 
grateful I am,’ Katie's voice cracked despite 
her best efforts. 

“Let me reiterate that you cannot let any- 
one know about this,’ 

Doctor Johnson put a hand on Katie’s 
shoulder. I couldn't tell if it was a sign of 
dominance or concern. 

“,.. or else everyone will be clamouring for 
their consciousness to be preserved electron- 
ically, you must understand ...” 

Katie nodded, no longer able to speak. 

“To the rest of the world, Evan is dead.” 

I went through each of the next 246 days 
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in my memory banks. I knew that they were 
just memories, but it was painful watching 
Katie as she struggled through the first few 
months of having me in this form. Around 
day 182 she finally stopped crying. That’s 
when the memory gaps started. Perhaps she 
hadn't stopped, and I was just consciously 
trying to forget... 

The door clicked open. Had eight hours 
passed already? Katie entered, followed by 
Dr (Brandon) Johnson. 

“I don't feel comfortable doing this in 
front of him ...” 

“Come on, you know we can just erase it 
later” 

“Katie?” 

He took off his leather shoes, placing them 
on the shoe rack without looking, as if hed 
done this every day of his life, while he took 
Katie in his arms. 

I understood now why they had not given 
mea voice. Katie resisted his grasp as they 
moved up to the bedroom. But she did not 
resist much. 

“KATIE” 

Brandon pushed my wife onto my bed, 
and tossed his shirt onto my camera. 

I tried not to listen. An eternity passed 
before he came back into view. 

“ARE YOU DONE YET?” 

He had the gall to laugh as he read my 
speech log. 

“Sorry, Evan? 

He connected his laptop to my port and 
typed. It’s funny how panic still feels the 
same, even though I no longer have adrenal 
glands. 

“DONT” 

“You know, you stop using punctuation 
when you're emotional. I should install auto- 
correct for you, don't you think?” 

Behind him, I saw Katie with the covers 
pulled up to her chest. She looked tired. 
Perhaps tired of having a husband who was 
nothing more than a ghost in a machine; 
who could not offer her human touch; whose 
entire repertoire of expression was limited to 
95 printable ASCTI characters. 

“Seeya; Brandon hit the return key. 

I must have fallen asleep, because I woke 
from the same dream I have every time. The 
sun had not come up yet. I watched Katie as 
she slept. m 


Grace Tang is a graduate student in 
psychology at Stanford University. Writing 
short stories is one of her favourite forms of 
structured procrastination. 
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Geometry and scale in species-area 


relationships 


ARISING FROM F. He & S. P. Hubbell Nature 473, 368-371 (2011) 


He and Hubbell developed a sampling theory for the species—area 
relationship (SAR) and the endemics-area relationship (EAR)'. 
They argued that the number of extinctions after habitat loss is 


Figure 1 | The outward EAR and the inward EAR. a, The outward EAR is 
calculated by counting the number of the endemic species to a rectangle from 
the centre to the periphery. b, The inward EAR is calculated by counting the 
number of endemics to an outer ring from the periphery to the centre. It is the 
inward EAR that replicates the geometry of the backward SAR. 
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described by the EAR and that extinction rates in previous studies 
are overestimates because the EAR is always lower than the SAR. Here 
we show that their conclusion is not general and depends on the 
geometry of habitat destruction and the scale of the SAR. We also 
question their critique of the Millennium Ecosystem Assessment 
estimates, as those estimates are not dependent on the SAR only, 
although important uncertainties remain due to other methodological 
issues. 

In several studies of extinction rates**, the proportion of extinc- 
tions after a habitat loss of area a from a total area A has been esti- 
mated from the power-law model of the SAR, Ssar(A) = cA’, as: 


Vata Ssar(A) —Ssar(A—a) = (1 “ (1) 


Ssar(A) A 


He and Hubbell call this method the backward SAR, as it uses esti- 
mates from the SAR in a backward way (from large to small areas) of 
how the SAR is constructed’. They argue that, instead, the number of 
extinctions is given by the proportion of endemics in a relative to A, 
which can be approximated by 


Figure 2 | The influence of scale and geometry on 
the EAR and the SAR. a, c, The graphs compare 
the outward EAR and the inward EAR with the 
backward SAR model (Asap) and the forward EAR 
model (JR) fitted to the data of each plot. Points 
correspond to the value of the EAR for each area 
size, sampled as in Fig. 1. The z value for Jzar 
comes from He and Hubbell’, whereas the z value 
for the As,r comes from the fit of the power law to 
the linear region of the SAR. b, d, Fit of the power- 
law SAR (Sgag) to the data on a log-log scale. Each 
point corresponds to the average number of species 
for randomly placed rectangles with a given area 
size. The SAR sampled from the centre to the 
periphery (Fig. 1a) gives similar z values 

(Zgcr = 0.1265 and Zyasuni = 0.0625). All z values 
were obtained by nonlinear least squares. The 
dashed vertical line marks the minimum area 
included in the fit. The top plots are for the tree and 
shrub species in the 50 ha plot in Barro Colorado 
Island (BCI), Panama, whereas the bottom plots 
are for the 50 ha plot in Yasuni, Ecuador. 
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VYear(4) = 1— (1 = “) : (2) 


where z’ is fit from the EAR and is always lower than the z from the 
SAR. The EAR is built in a forward fashion, counting the endemic 
species in progressively larger areas. 

It is uncontroversial that the species that go extinct immediately 
after habitat destruction are the endemic species to the area 
removed’®. However, both Agar and Asap describe the proportion 
of endemics in an area a, although of different geometry”. If destruc- 
tion starts from the centre of the patch (Fig. 1a), then Agar describes 
the number of extinctions because it approximates the proportion of 
endemics in progressively larger rectangles, the outward EAR 
(Fig. 2a). In contrast, if destruction occurs in the periphery 
(Fig. 1b), it is Asap that describes extinctions because it approximates 
the number of endemics in outer rings towards the centre of the plot, 
the inward EAR (Fig. 2a). This happens because only the inward EAR 
backtracks the geometry of how the SAR is built. This backtracking is 
exact if the SAR is built as in Fig. 1a, or approximate if the SAR is built 
from sampling several rectangles for each area size, but the z values of 
both methods are almost the same (Fig. 2). 

Note that, depending on the spatial structure of the distribution of 
species in the plot, the outward EAR may be similar to the inward EAR 
(Fig. 2c), but Asar is always a good approximation of the inward EAR 
as long as the SAR data points fit the power law. This fit depends on 
the scale of the SAR. Several studies have shown that at very small 
scales the SAR is curvilinear in a log-log scale’, as can be observed in 
the Barro Colorado Island and Yasuni plots (Fig. 2b, d). Therefore, the 
z of the SAR must be calculated for the linear region that is relevant for 
the extinction projections. 

There are many other sources of uncertainties in estimating future 
extinction rates. For instance, both the SAR and EAR project that all 
species go extinct after all native habitat is lost, ignoring that many 
species persist in human-modified habitats. The countryside SAR 
addresses this problem by tracking the number of species with 
similar habitat affinities in multiple habitats*, Another open 
question is what type of SAR better describes long-term extinctions 
after habitat loss. After a first stage of extinction of endemics, 
described by the EAR or the backward SAR, many species that still 
occur in the landscape will go extinct because the habitat left for 
them is smaller then their minimum required habitat size®. In this 
case, it has been proposed that future extinction rates are better 
described by the island SAR (built from counting the number of 
species in different islands)’. 

The Millennium Ecosystem Assessment drew in a wide range of 
extinction projections to identify the envelope of those uncertainties’. 
The SAR projections** were consistent with estimates from other 
methods, such as assessing the extinction risk of currently threatened 


species'®"'. In 2010 there was a revised assessment with more recent 
global extinction projections”, in which SAR-based projections again 
had a limited role, and new approaches such as the overlap of species 
ranges with habitat loss'’, ecophysiological models" and the correla- 
tion between elevational range and extinction risk, were included’. 
The range of uncertainty across models and scenarios was close to 
three orders of magnitude, compared to which the uncertainty now 
identified by He and Hubbell’ is negligible. In all cases models and 
scenarios supported the Millennium Ecosystem Assessment conclu- 
sions that biodiversity will continue to decline, and in most cases at 
increasing rates relatively to the recent past. 
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Extinction and climate change 


ARISING FROM F. He & S. P. Hubbell Nature 473, 368-371 (2011) 


Statistical relationships between habitat area and the number of 
species observed (species—area relationships, SARs) are sometimes 
used to assess extinction risks following habitat destruction or loss 
of climatic suitability. He and Hubbell’ argue that the numbers of 
species confined to—rather than observed in—different areas 
(endemics-—area relationships, EARs) should be used instead of 
SARs, and that SAR-based extinction estimates in the literature 
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are too high. We suggest that He and Hubbell’s SAR estimates are 
biased, that the empirical data they use are not appropriate to 
calculate extinction risks, and that their statements about extinction 
risks from climate change? do not take into account non-SAR-based 
estimates or recent observations. Species have already responded to 
climate change in a manner consistent with high future extinction 
risks. 
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Most of He and Hubbell’s results involved analysis of the number of 
tree species in 0.2ha and successively larger subplots within forest 
stands of 20-50 ha. By only counting the tree stems present in a plot 
(rather than canopies), they underestimate the true number of species 
present in small subplots. This artefact exaggerates SAR slopes when 
subplots smaller than ~2.5 ha are included’. 

We suggest that the data He and Hubbell’ use are not appropriate 
to calculate SAR or EAR slopes that are relevant to extinction. To 
calculate extinction risks, it is necessary to consider how many 
species might be lost if a habitat becomes isolated; however, He 
and Hubbell used data for forest plots that are surrounded by more 
forest, and for bird distributional cells that are surrounded by 
other land where birds also live. He and Hubbell’ consider the 
instantaneous presence of species in sample plots within contiguous 
areas, not the expected long-term persistence of species if these 
habitats were isolated. On average, 31 species of birds bred each year 
in Eastern Wood in England (instantaneous number), but only 16 
species bred in every one of 25 years (persistent species)*. Were this 
woodland completely isolated from other breeding habitats, the 
number of species would about halve in 25 years, resulting in much 
steeper SAR slopes. It is not known whether SAR and EAR estimates 
would steepen equally or converge for true isolates, so He and 
Hubbell’s' main conclusion that SARs overestimate extinction 
remains unsubstantiated. 

He and Hubbell’ consider that previous’ SAR-based estimates of 
species “committed to extinction’ from climate change (18-35% by 
2050) are too high. However, most published estimates of extinction 
risk from climate change do not derive from SAR’. For example, it has 
been estimated’ that “5%, 8% and 16% (mean of dispersal scenarios) of 
the species considered would have lost 100% of their climatically suitable 
area by 2050, for minimum, mid-range and maximum climate warming, 
respectively” and that “15%, 22% and 40%... are projected to have lost 
more than 90%... by 2050.” Given the near-linear continuation of global 
warming projected before and after 2050, most species losing >90% of 
their climatically suitable areas over the period ~ 1970-2050 (and many 
additional species losing 70-90%) would lose 100% of their area long 
before 2100. With time lags in both human and climate systems, at least 
15-40% of the species analysed are effectively committed to extinction 
by 2050. 

He and Hubbell’ also argue that projected extinctions exceed those 
observed, but high population-level extinction rates have already been 
observed: ~20% climate-related losses within 500 km of retreating 
latitudinal boundaries’, 34% loss of populated areas at retreating 
elevation boundaries®, and loss of an estimated 4% of worldwide lizard 
populations, consistent with 20% loss of lizard species by 2080°. Cloud 
forest moth species on Mount Kinabalu in Borneo have contracted at 
both lower and upper boundaries”’ at a rate that, if sustained, would 
extinguish ~45% of the endemic species by 2100. Amphibians and 
reptiles have shifted higher in Tsaratanana Massif in Madagascar, 
where three (5.9% of 51 species considered) of the highest elevation 
species were not found in 2003". At Monteverde in Costa Rica, two 
high elevation anole lizard species became extinct from the study area, 
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and two high elevation frog/toad species became globally extinct, after 
dry years’*. The pathogen-induced extinction of ~2.2% of New 
World amphibian species (harlequin frogs) coincided with unusually 
hot years’’. A third of the world’s coral species are threatened by a 
combination of temperature-induced bleaching, ocean acidification 
and other pressures”. 

Anthropogenic warming so far is less than or equal to half of that 
expected by 2050, and modelled biodiversity losses accelerate with 
increased warming. Recently observed range shifts have tracked levels 
of climate change’’, and these empirical trends are concordant with 
projected 2050/2100 losses. Although many uncertainties remain, we 
believe that He and Hubbell’s conclusions about extinction risks are 
unjustified. 
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REPLYING TO H. M. Pereira, L. Borda-de-Agua &\|. Santos Martins Nature 482, doi.10.1038/nature10857; C.D. Thomas & M. Williamson Nature 482, 


doi10.1038/nature10858 


Pereira et al.' argue that our conclusion’ that species—area relationships 
(SARs) always overestimate extinction is not general because the spatial 
configuration of landscape destruction can influence the results. 


Thomas and Williamson* argue that there are many other causes of 
extinction besides habitat loss. We agree with the latter comment, but 
show that the arguments of Pereira et al. are not substantiated. 
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Conservation biologists make wide use of SARs to estimate species 
extinction caused by habitat loss. The mathematics underpinning this 
application is? 


EAR(a) =SAR(A) —SAR(A — a) (1) 


where EAR(a) is the number of species endemic to subarea a that is 
nested within the regional area A, SAR(A) is the total number of 
species in the region, and SAR(A — a) is the number of species in 
the complementary area A — a. 

EAR(a) is the number of species immediately lost if habitat area a is 
destroyed. EAR(a) is usually not known because data on the global 
distribution of species are not available. Traditionally, EAR(a) is 
obtained by substituting a SAR model, usually the power-law SAR 
model, into equation (1). However, by making this substitution, our 
paper’ shows that one inevitably overestimates the average, or 
expected, extinction rate. This so-called backward SAR method is a 
method for estimating endemic species, not ‘extinction debt’. The 
backward SAR method has nothing to do with, and does not measure, 
extinction debt. We do not question the existence of extinction debt, 
but to measure extinction debt it is necessary to use other methods. 

There are four reasons that the arguments of Pereira et al. are not 
substantiated. First, Pereira et al.! commit a statistical error by confus- 
ing a specific configuration of landscape destruction with the statistical 
expectation. The SAR is a macroecological pattern defined as the 
expected number of species as a function of area. The word ‘always’ 
in the title of our paper’ refers to the fact that the expectation of 
extinction rate is always biased too high if one uses the backward 
power-law SAR method. One certainly cannot trust any single specific 
case of the extinction rate estimated in this manner to be reliable, and 
our result is a general proof that shows that the average extinction rate 
so estimated is always an overestimate. 

Second, if what Pereira et al. say is correct, then the outward EAR and 
the inward EAR must be different, but they are not different in their own 
analysis of the Yasuni plot (figure 2c in ref. 1), undermining their claim. 
The configuration of destruction can matter only to specific samples, 
but does not eliminate the bias we show exists in the statistical expecta- 
tion. It is unclear why outward-inward destruction should be so special, 
versus, for example, left-to-right or up-to-down destruction. Clearly, a 
specific destruction pattern cannot represent the general expectation 
because it is just one sample of many possible patterns of destruction. 

Third, Pereira et al. compare the SARs between the inward and 
outward configurations for the Barro Colorado Island (BCI) and 
Yasuni plots and argue that the inward EAR can be predicted by 
the backward SAR for the two plots because the Zs,p values for both 
configurations are similar. However, a close scrutiny of the method 
used to calculate these z values shows that the result is the outcome of 
post-hoc selected ranges of area over which they chose to fit the SARs. 
For the BCI plot, they used 1 ha as the minimal area, but they used a 
5 ha minimum for the Yasuni plot (figure 2b, d in ref. 1). The problem 
is that one can obtain practically any z value by arbitrarily varying the 
minimal area. This is because in small areas the SAR is only approxi- 
mately a power law, and including small areas when fitting the power- 
law SAR model inflates the z value. This arbitrary post-hoc selection 
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of z values invalidates their comparison. The minimum area in our 
study’ is consistently set to be =0.2 ha across all plots to ensure that 
the analyses are standardized and comparable and that the log-log 
SARs are adequately linear with R* > 0.92. Using a consistent minimum 
area, one does not obtain their result. 

Fourth, Pereira et al. argue that island SARs are more appropriate 
models for estimating extinction rates. This is not correct. Regardless 
of what you call the SAR or the reason why island SARs generally have 
steeper slopes than continental SARs, people use the same backwards 
SAR model to estimate extinction rates on continents and in island 
archipelagoes. In instances in which z values are not available, 
researchers universally use z = 0.25 (refs 4, 5). 

We do not disagree with Thomas and Williamson? that extinction 
is caused by many factors, not just habitat loss, including climate 
change, and we also agree that extinction is real and happening at 
elevated rates. All we have shown is that the backward SAR method is 
not appropriate for estimating extinction rates caused by habitat loss. 
Any extinction rates estimated from that method are questionable. 
Weare well aware that species extinction can be evaluated by a variety 
of methods. Not all of the extinction estimates in the Millennium 
Ecosystem Assessment used the flawed backwards power-law 
method. We did not question or assess the validity of those methods 
because our study does not apply to them. We also did not criticize the 
methods used by the Intergovernmental Panel on Climate Change or 
the International Union for Conservation of Nature to estimate 
extinctions, contrary to misquotes in the press. 

For further information, a JAVA program written by G. Acevo that 
computes SAR and EAR curves and expectations for model com- 
munities is available for download from http://shubbell.eeb.ucla.edu/ 
earsar.php. 
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Geometry and scale in species-area 


relationships 


ARISING FROM F. He & S. P. Hubbell Nature 473, 368-371 (2011) 


He and Hubbell developed a sampling theory for the species—area 
relationship (SAR) and the endemics-area relationship (EAR)'. 
They argued that the number of extinctions after habitat loss is 


Figure 1 | The outward EAR and the inward EAR. a, The outward EAR is 
calculated by counting the number of the endemic species to a rectangle from 
the centre to the periphery. b, The inward EAR is calculated by counting the 
number of endemics to an outer ring from the periphery to the centre. It is the 
inward EAR that replicates the geometry of the backward SAR. 
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described by the EAR and that extinction rates in previous studies 
are overestimates because the EAR is always lower than the SAR. Here 
we show that their conclusion is not general and depends on the 
geometry of habitat destruction and the scale of the SAR. We also 
question their critique of the Millennium Ecosystem Assessment 
estimates, as those estimates are not dependent on the SAR only, 
although important uncertainties remain due to other methodological 
issues. 

In several studies of extinction rates**, the proportion of extinc- 
tions after a habitat loss of area a from a total area A has been esti- 
mated from the power-law model of the SAR, Ssar(A) = cA’, as: 


Vata Ssar(A) —Ssar(A—a) = (1 “ (1) 


Ssar(A) A 


He and Hubbell call this method the backward SAR, as it uses esti- 
mates from the SAR in a backward way (from large to small areas) of 
how the SAR is constructed’. They argue that, instead, the number of 
extinctions is given by the proportion of endemics in a relative to A, 
which can be approximated by 


Figure 2 | The influence of scale and geometry on 
the EAR and the SAR. a, c, The graphs compare 
the outward EAR and the inward EAR with the 
backward SAR model (Asap) and the forward EAR 
model (JR) fitted to the data of each plot. Points 
correspond to the value of the EAR for each area 
size, sampled as in Fig. 1. The z value for Jzar 
comes from He and Hubbell’, whereas the z value 
for the As,r comes from the fit of the power law to 
the linear region of the SAR. b, d, Fit of the power- 
law SAR (Sgag) to the data on a log-log scale. Each 
point corresponds to the average number of species 
for randomly placed rectangles with a given area 
size. The SAR sampled from the centre to the 
periphery (Fig. 1a) gives similar z values 

(Zgcr = 0.1265 and Zyasuni = 0.0625). All z values 
were obtained by nonlinear least squares. The 
dashed vertical line marks the minimum area 
included in the fit. The top plots are for the tree and 
shrub species in the 50 ha plot in Barro Colorado 
Island (BCI), Panama, whereas the bottom plots 
are for the 50 ha plot in Yasuni, Ecuador. 
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VYear(4) = 1— (1 = “) : (2) 


where z’ is fit from the EAR and is always lower than the z from the 
SAR. The EAR is built in a forward fashion, counting the endemic 
species in progressively larger areas. 

It is uncontroversial that the species that go extinct immediately 
after habitat destruction are the endemic species to the area 
removed’®. However, both Agar and Asap describe the proportion 
of endemics in an area a, although of different geometry”. If destruc- 
tion starts from the centre of the patch (Fig. 1a), then Agar describes 
the number of extinctions because it approximates the proportion of 
endemics in progressively larger rectangles, the outward EAR 
(Fig. 2a). In contrast, if destruction occurs in the periphery 
(Fig. 1b), it is Asap that describes extinctions because it approximates 
the number of endemics in outer rings towards the centre of the plot, 
the inward EAR (Fig. 2a). This happens because only the inward EAR 
backtracks the geometry of how the SAR is built. This backtracking is 
exact if the SAR is built as in Fig. 1a, or approximate if the SAR is built 
from sampling several rectangles for each area size, but the z values of 
both methods are almost the same (Fig. 2). 

Note that, depending on the spatial structure of the distribution of 
species in the plot, the outward EAR may be similar to the inward EAR 
(Fig. 2c), but Asar is always a good approximation of the inward EAR 
as long as the SAR data points fit the power law. This fit depends on 
the scale of the SAR. Several studies have shown that at very small 
scales the SAR is curvilinear in a log-log scale’, as can be observed in 
the Barro Colorado Island and Yasuni plots (Fig. 2b, d). Therefore, the 
z of the SAR must be calculated for the linear region that is relevant for 
the extinction projections. 

There are many other sources of uncertainties in estimating future 
extinction rates. For instance, both the SAR and EAR project that all 
species go extinct after all native habitat is lost, ignoring that many 
species persist in human-modified habitats. The countryside SAR 
addresses this problem by tracking the number of species with 
similar habitat affinities in multiple habitats*, Another open 
question is what type of SAR better describes long-term extinctions 
after habitat loss. After a first stage of extinction of endemics, 
described by the EAR or the backward SAR, many species that still 
occur in the landscape will go extinct because the habitat left for 
them is smaller then their minimum required habitat size®. In this 
case, it has been proposed that future extinction rates are better 
described by the island SAR (built from counting the number of 
species in different islands)’. 

The Millennium Ecosystem Assessment drew in a wide range of 
extinction projections to identify the envelope of those uncertainties’. 
The SAR projections** were consistent with estimates from other 
methods, such as assessing the extinction risk of currently threatened 


species'®"'. In 2010 there was a revised assessment with more recent 
global extinction projections”, in which SAR-based projections again 
had a limited role, and new approaches such as the overlap of species 
ranges with habitat loss'’, ecophysiological models" and the correla- 
tion between elevational range and extinction risk, were included’. 
The range of uncertainty across models and scenarios was close to 
three orders of magnitude, compared to which the uncertainty now 
identified by He and Hubbell’ is negligible. In all cases models and 
scenarios supported the Millennium Ecosystem Assessment conclu- 
sions that biodiversity will continue to decline, and in most cases at 
increasing rates relatively to the recent past. 
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Extinction and climate change 


ARISING FROM F. He & S. P. Hubbell Nature 473, 368-371 (2011) 


Statistical relationships between habitat area and the number of 
species observed (species—area relationships, SARs) are sometimes 
used to assess extinction risks following habitat destruction or loss 
of climatic suitability. He and Hubbell’ argue that the numbers of 
species confined to—rather than observed in—different areas 
(endemics-—area relationships, EARs) should be used instead of 
SARs, and that SAR-based extinction estimates in the literature 
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are too high. We suggest that He and Hubbell’s SAR estimates are 
biased, that the empirical data they use are not appropriate to 
calculate extinction risks, and that their statements about extinction 
risks from climate change? do not take into account non-SAR-based 
estimates or recent observations. Species have already responded to 
climate change in a manner consistent with high future extinction 
risks. 
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Most of He and Hubbell’s results involved analysis of the number of 
tree species in 0.2ha and successively larger subplots within forest 
stands of 20-50 ha. By only counting the tree stems present in a plot 
(rather than canopies), they underestimate the true number of species 
present in small subplots. This artefact exaggerates SAR slopes when 
subplots smaller than ~2.5 ha are included’. 

We suggest that the data He and Hubbell’ use are not appropriate 
to calculate SAR or EAR slopes that are relevant to extinction. To 
calculate extinction risks, it is necessary to consider how many 
species might be lost if a habitat becomes isolated; however, He 
and Hubbell used data for forest plots that are surrounded by more 
forest, and for bird distributional cells that are surrounded by 
other land where birds also live. He and Hubbell’ consider the 
instantaneous presence of species in sample plots within contiguous 
areas, not the expected long-term persistence of species if these 
habitats were isolated. On average, 31 species of birds bred each year 
in Eastern Wood in England (instantaneous number), but only 16 
species bred in every one of 25 years (persistent species)*. Were this 
woodland completely isolated from other breeding habitats, the 
number of species would about halve in 25 years, resulting in much 
steeper SAR slopes. It is not known whether SAR and EAR estimates 
would steepen equally or converge for true isolates, so He and 
Hubbell’s' main conclusion that SARs overestimate extinction 
remains unsubstantiated. 

He and Hubbell’ consider that previous’ SAR-based estimates of 
species “committed to extinction’ from climate change (18-35% by 
2050) are too high. However, most published estimates of extinction 
risk from climate change do not derive from SAR’. For example, it has 
been estimated’ that “5%, 8% and 16% (mean of dispersal scenarios) of 
the species considered would have lost 100% of their climatically suitable 
area by 2050, for minimum, mid-range and maximum climate warming, 
respectively” and that “15%, 22% and 40%... are projected to have lost 
more than 90%... by 2050.” Given the near-linear continuation of global 
warming projected before and after 2050, most species losing >90% of 
their climatically suitable areas over the period ~ 1970-2050 (and many 
additional species losing 70-90%) would lose 100% of their area long 
before 2100. With time lags in both human and climate systems, at least 
15-40% of the species analysed are effectively committed to extinction 
by 2050. 

He and Hubbell’ also argue that projected extinctions exceed those 
observed, but high population-level extinction rates have already been 
observed: ~20% climate-related losses within 500 km of retreating 
latitudinal boundaries’, 34% loss of populated areas at retreating 
elevation boundaries®, and loss of an estimated 4% of worldwide lizard 
populations, consistent with 20% loss of lizard species by 2080°. Cloud 
forest moth species on Mount Kinabalu in Borneo have contracted at 
both lower and upper boundaries”’ at a rate that, if sustained, would 
extinguish ~45% of the endemic species by 2100. Amphibians and 
reptiles have shifted higher in Tsaratanana Massif in Madagascar, 
where three (5.9% of 51 species considered) of the highest elevation 
species were not found in 2003". At Monteverde in Costa Rica, two 
high elevation anole lizard species became extinct from the study area, 


He and Hubbell reply 


and two high elevation frog/toad species became globally extinct, after 
dry years’*. The pathogen-induced extinction of ~2.2% of New 
World amphibian species (harlequin frogs) coincided with unusually 
hot years’’. A third of the world’s coral species are threatened by a 
combination of temperature-induced bleaching, ocean acidification 
and other pressures”. 

Anthropogenic warming so far is less than or equal to half of that 
expected by 2050, and modelled biodiversity losses accelerate with 
increased warming. Recently observed range shifts have tracked levels 
of climate change’’, and these empirical trends are concordant with 
projected 2050/2100 losses. Although many uncertainties remain, we 
believe that He and Hubbell’s conclusions about extinction risks are 
unjustified. 
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REPLYING TO H. M. Pereira, L. Borda-de-Agua &\|. Santos Martins Nature 482, doi.10.1038/nature10857; C.D. Thomas & M. Williamson Nature 482, 


doi10.1038/nature10858 


Pereira et al.' argue that our conclusion’ that species—area relationships 
(SARs) always overestimate extinction is not general because the spatial 
configuration of landscape destruction can influence the results. 


Thomas and Williamson* argue that there are many other causes of 
extinction besides habitat loss. We agree with the latter comment, but 
show that the arguments of Pereira et al. are not substantiated. 
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Conservation biologists make wide use of SARs to estimate species 
extinction caused by habitat loss. The mathematics underpinning this 
application is? 


EAR(a) =SAR(A) —SAR(A — a) (1) 


where EAR(a) is the number of species endemic to subarea a that is 
nested within the regional area A, SAR(A) is the total number of 
species in the region, and SAR(A — a) is the number of species in 
the complementary area A — a. 

EAR(a) is the number of species immediately lost if habitat area a is 
destroyed. EAR(a) is usually not known because data on the global 
distribution of species are not available. Traditionally, EAR(a) is 
obtained by substituting a SAR model, usually the power-law SAR 
model, into equation (1). However, by making this substitution, our 
paper’ shows that one inevitably overestimates the average, or 
expected, extinction rate. This so-called backward SAR method is a 
method for estimating endemic species, not ‘extinction debt’. The 
backward SAR method has nothing to do with, and does not measure, 
extinction debt. We do not question the existence of extinction debt, 
but to measure extinction debt it is necessary to use other methods. 

There are four reasons that the arguments of Pereira et al. are not 
substantiated. First, Pereira et al.! commit a statistical error by confus- 
ing a specific configuration of landscape destruction with the statistical 
expectation. The SAR is a macroecological pattern defined as the 
expected number of species as a function of area. The word ‘always’ 
in the title of our paper’ refers to the fact that the expectation of 
extinction rate is always biased too high if one uses the backward 
power-law SAR method. One certainly cannot trust any single specific 
case of the extinction rate estimated in this manner to be reliable, and 
our result is a general proof that shows that the average extinction rate 
so estimated is always an overestimate. 

Second, if what Pereira et al. say is correct, then the outward EAR and 
the inward EAR must be different, but they are not different in their own 
analysis of the Yasuni plot (figure 2c in ref. 1), undermining their claim. 
The configuration of destruction can matter only to specific samples, 
but does not eliminate the bias we show exists in the statistical expecta- 
tion. It is unclear why outward-inward destruction should be so special, 
versus, for example, left-to-right or up-to-down destruction. Clearly, a 
specific destruction pattern cannot represent the general expectation 
because it is just one sample of many possible patterns of destruction. 

Third, Pereira et al. compare the SARs between the inward and 
outward configurations for the Barro Colorado Island (BCI) and 
Yasuni plots and argue that the inward EAR can be predicted by 
the backward SAR for the two plots because the Zs,p values for both 
configurations are similar. However, a close scrutiny of the method 
used to calculate these z values shows that the result is the outcome of 
post-hoc selected ranges of area over which they chose to fit the SARs. 
For the BCI plot, they used 1 ha as the minimal area, but they used a 
5 ha minimum for the Yasuni plot (figure 2b, d in ref. 1). The problem 
is that one can obtain practically any z value by arbitrarily varying the 
minimal area. This is because in small areas the SAR is only approxi- 
mately a power law, and including small areas when fitting the power- 
law SAR model inflates the z value. This arbitrary post-hoc selection 
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of z values invalidates their comparison. The minimum area in our 
study’ is consistently set to be =0.2 ha across all plots to ensure that 
the analyses are standardized and comparable and that the log-log 
SARs are adequately linear with R* > 0.92. Using a consistent minimum 
area, one does not obtain their result. 

Fourth, Pereira et al. argue that island SARs are more appropriate 
models for estimating extinction rates. This is not correct. Regardless 
of what you call the SAR or the reason why island SARs generally have 
steeper slopes than continental SARs, people use the same backwards 
SAR model to estimate extinction rates on continents and in island 
archipelagoes. In instances in which z values are not available, 
researchers universally use z = 0.25 (refs 4, 5). 

We do not disagree with Thomas and Williamson? that extinction 
is caused by many factors, not just habitat loss, including climate 
change, and we also agree that extinction is real and happening at 
elevated rates. All we have shown is that the backward SAR method is 
not appropriate for estimating extinction rates caused by habitat loss. 
Any extinction rates estimated from that method are questionable. 
Weare well aware that species extinction can be evaluated by a variety 
of methods. Not all of the extinction estimates in the Millennium 
Ecosystem Assessment used the flawed backwards power-law 
method. We did not question or assess the validity of those methods 
because our study does not apply to them. We also did not criticize the 
methods used by the Intergovernmental Panel on Climate Change or 
the International Union for Conservation of Nature to estimate 
extinctions, contrary to misquotes in the press. 

For further information, a JAVA program written by G. Acevo that 
computes SAR and EAR curves and expectations for model com- 
munities is available for download from http://shubbell.eeb.ucla.edu/ 
earsar.php. 
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Geometry and scale in species-area 


relationships 


ARISING FROM F. He & S. P. Hubbell Nature 473, 368-371 (2011) 


He and Hubbell developed a sampling theory for the species—area 
relationship (SAR) and the endemics-area relationship (EAR)'. 
They argued that the number of extinctions after habitat loss is 


Figure 1 | The outward EAR and the inward EAR. a, The outward EAR is 
calculated by counting the number of the endemic species to a rectangle from 
the centre to the periphery. b, The inward EAR is calculated by counting the 
number of endemics to an outer ring from the periphery to the centre. It is the 
inward EAR that replicates the geometry of the backward SAR. 
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described by the EAR and that extinction rates in previous studies 
are overestimates because the EAR is always lower than the SAR. Here 
we show that their conclusion is not general and depends on the 
geometry of habitat destruction and the scale of the SAR. We also 
question their critique of the Millennium Ecosystem Assessment 
estimates, as those estimates are not dependent on the SAR only, 
although important uncertainties remain due to other methodological 
issues. 

In several studies of extinction rates**, the proportion of extinc- 
tions after a habitat loss of area a from a total area A has been esti- 
mated from the power-law model of the SAR, Ssar(A) = cA’, as: 


Vata Ssar(A) —Ssar(A—a) = (1 “ (1) 


Ssar(A) A 


He and Hubbell call this method the backward SAR, as it uses esti- 
mates from the SAR in a backward way (from large to small areas) of 
how the SAR is constructed’. They argue that, instead, the number of 
extinctions is given by the proportion of endemics in a relative to A, 
which can be approximated by 


Figure 2 | The influence of scale and geometry on 
the EAR and the SAR. a, c, The graphs compare 
the outward EAR and the inward EAR with the 
backward SAR model (Asap) and the forward EAR 
model (JR) fitted to the data of each plot. Points 
correspond to the value of the EAR for each area 
size, sampled as in Fig. 1. The z value for Jzar 
comes from He and Hubbell’, whereas the z value 
for the As,r comes from the fit of the power law to 
the linear region of the SAR. b, d, Fit of the power- 
law SAR (Sgag) to the data on a log-log scale. Each 
point corresponds to the average number of species 
for randomly placed rectangles with a given area 
size. The SAR sampled from the centre to the 
periphery (Fig. 1a) gives similar z values 

(Zgcr = 0.1265 and Zyasuni = 0.0625). All z values 
were obtained by nonlinear least squares. The 
dashed vertical line marks the minimum area 
included in the fit. The top plots are for the tree and 
shrub species in the 50 ha plot in Barro Colorado 
Island (BCI), Panama, whereas the bottom plots 
are for the 50 ha plot in Yasuni, Ecuador. 
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VYear(4) = 1— (1 = “) : (2) 


where z’ is fit from the EAR and is always lower than the z from the 
SAR. The EAR is built in a forward fashion, counting the endemic 
species in progressively larger areas. 

It is uncontroversial that the species that go extinct immediately 
after habitat destruction are the endemic species to the area 
removed’®. However, both Agar and Asap describe the proportion 
of endemics in an area a, although of different geometry”. If destruc- 
tion starts from the centre of the patch (Fig. 1a), then Agar describes 
the number of extinctions because it approximates the proportion of 
endemics in progressively larger rectangles, the outward EAR 
(Fig. 2a). In contrast, if destruction occurs in the periphery 
(Fig. 1b), it is Asap that describes extinctions because it approximates 
the number of endemics in outer rings towards the centre of the plot, 
the inward EAR (Fig. 2a). This happens because only the inward EAR 
backtracks the geometry of how the SAR is built. This backtracking is 
exact if the SAR is built as in Fig. 1a, or approximate if the SAR is built 
from sampling several rectangles for each area size, but the z values of 
both methods are almost the same (Fig. 2). 

Note that, depending on the spatial structure of the distribution of 
species in the plot, the outward EAR may be similar to the inward EAR 
(Fig. 2c), but Asar is always a good approximation of the inward EAR 
as long as the SAR data points fit the power law. This fit depends on 
the scale of the SAR. Several studies have shown that at very small 
scales the SAR is curvilinear in a log-log scale’, as can be observed in 
the Barro Colorado Island and Yasuni plots (Fig. 2b, d). Therefore, the 
z of the SAR must be calculated for the linear region that is relevant for 
the extinction projections. 

There are many other sources of uncertainties in estimating future 
extinction rates. For instance, both the SAR and EAR project that all 
species go extinct after all native habitat is lost, ignoring that many 
species persist in human-modified habitats. The countryside SAR 
addresses this problem by tracking the number of species with 
similar habitat affinities in multiple habitats*, Another open 
question is what type of SAR better describes long-term extinctions 
after habitat loss. After a first stage of extinction of endemics, 
described by the EAR or the backward SAR, many species that still 
occur in the landscape will go extinct because the habitat left for 
them is smaller then their minimum required habitat size®. In this 
case, it has been proposed that future extinction rates are better 
described by the island SAR (built from counting the number of 
species in different islands)’. 

The Millennium Ecosystem Assessment drew in a wide range of 
extinction projections to identify the envelope of those uncertainties’. 
The SAR projections** were consistent with estimates from other 
methods, such as assessing the extinction risk of currently threatened 


species'®"'. In 2010 there was a revised assessment with more recent 
global extinction projections”, in which SAR-based projections again 
had a limited role, and new approaches such as the overlap of species 
ranges with habitat loss'’, ecophysiological models" and the correla- 
tion between elevational range and extinction risk, were included’. 
The range of uncertainty across models and scenarios was close to 
three orders of magnitude, compared to which the uncertainty now 
identified by He and Hubbell’ is negligible. In all cases models and 
scenarios supported the Millennium Ecosystem Assessment conclu- 
sions that biodiversity will continue to decline, and in most cases at 
increasing rates relatively to the recent past. 
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Extinction and climate change 


ARISING FROM F. He & S. P. Hubbell Nature 473, 368-371 (2011) 


Statistical relationships between habitat area and the number of 
species observed (species—area relationships, SARs) are sometimes 
used to assess extinction risks following habitat destruction or loss 
of climatic suitability. He and Hubbell’ argue that the numbers of 
species confined to—rather than observed in—different areas 
(endemics-—area relationships, EARs) should be used instead of 
SARs, and that SAR-based extinction estimates in the literature 
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are too high. We suggest that He and Hubbell’s SAR estimates are 
biased, that the empirical data they use are not appropriate to 
calculate extinction risks, and that their statements about extinction 
risks from climate change? do not take into account non-SAR-based 
estimates or recent observations. Species have already responded to 
climate change in a manner consistent with high future extinction 
risks. 
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Most of He and Hubbell’s results involved analysis of the number of 
tree species in 0.2ha and successively larger subplots within forest 
stands of 20-50 ha. By only counting the tree stems present in a plot 
(rather than canopies), they underestimate the true number of species 
present in small subplots. This artefact exaggerates SAR slopes when 
subplots smaller than ~2.5 ha are included’. 

We suggest that the data He and Hubbell’ use are not appropriate 
to calculate SAR or EAR slopes that are relevant to extinction. To 
calculate extinction risks, it is necessary to consider how many 
species might be lost if a habitat becomes isolated; however, He 
and Hubbell used data for forest plots that are surrounded by more 
forest, and for bird distributional cells that are surrounded by 
other land where birds also live. He and Hubbell’ consider the 
instantaneous presence of species in sample plots within contiguous 
areas, not the expected long-term persistence of species if these 
habitats were isolated. On average, 31 species of birds bred each year 
in Eastern Wood in England (instantaneous number), but only 16 
species bred in every one of 25 years (persistent species)*. Were this 
woodland completely isolated from other breeding habitats, the 
number of species would about halve in 25 years, resulting in much 
steeper SAR slopes. It is not known whether SAR and EAR estimates 
would steepen equally or converge for true isolates, so He and 
Hubbell’s' main conclusion that SARs overestimate extinction 
remains unsubstantiated. 

He and Hubbell’ consider that previous’ SAR-based estimates of 
species “committed to extinction’ from climate change (18-35% by 
2050) are too high. However, most published estimates of extinction 
risk from climate change do not derive from SAR’. For example, it has 
been estimated’ that “5%, 8% and 16% (mean of dispersal scenarios) of 
the species considered would have lost 100% of their climatically suitable 
area by 2050, for minimum, mid-range and maximum climate warming, 
respectively” and that “15%, 22% and 40%... are projected to have lost 
more than 90%... by 2050.” Given the near-linear continuation of global 
warming projected before and after 2050, most species losing >90% of 
their climatically suitable areas over the period ~ 1970-2050 (and many 
additional species losing 70-90%) would lose 100% of their area long 
before 2100. With time lags in both human and climate systems, at least 
15-40% of the species analysed are effectively committed to extinction 
by 2050. 

He and Hubbell’ also argue that projected extinctions exceed those 
observed, but high population-level extinction rates have already been 
observed: ~20% climate-related losses within 500 km of retreating 
latitudinal boundaries’, 34% loss of populated areas at retreating 
elevation boundaries®, and loss of an estimated 4% of worldwide lizard 
populations, consistent with 20% loss of lizard species by 2080°. Cloud 
forest moth species on Mount Kinabalu in Borneo have contracted at 
both lower and upper boundaries”’ at a rate that, if sustained, would 
extinguish ~45% of the endemic species by 2100. Amphibians and 
reptiles have shifted higher in Tsaratanana Massif in Madagascar, 
where three (5.9% of 51 species considered) of the highest elevation 
species were not found in 2003". At Monteverde in Costa Rica, two 
high elevation anole lizard species became extinct from the study area, 


He and Hubbell reply 


and two high elevation frog/toad species became globally extinct, after 
dry years’*. The pathogen-induced extinction of ~2.2% of New 
World amphibian species (harlequin frogs) coincided with unusually 
hot years’’. A third of the world’s coral species are threatened by a 
combination of temperature-induced bleaching, ocean acidification 
and other pressures”. 

Anthropogenic warming so far is less than or equal to half of that 
expected by 2050, and modelled biodiversity losses accelerate with 
increased warming. Recently observed range shifts have tracked levels 
of climate change’’, and these empirical trends are concordant with 
projected 2050/2100 losses. Although many uncertainties remain, we 
believe that He and Hubbell’s conclusions about extinction risks are 
unjustified. 
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REPLYING TO H. M. Pereira, L. Borda-de-Agua &\|. Santos Martins Nature 482, doi.10.1038/nature10857; C.D. Thomas & M. Williamson Nature 482, 


doi10.1038/nature10858 


Pereira et al.' argue that our conclusion’ that species—area relationships 
(SARs) always overestimate extinction is not general because the spatial 
configuration of landscape destruction can influence the results. 


Thomas and Williamson* argue that there are many other causes of 
extinction besides habitat loss. We agree with the latter comment, but 
show that the arguments of Pereira et al. are not substantiated. 
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Conservation biologists make wide use of SARs to estimate species 
extinction caused by habitat loss. The mathematics underpinning this 
application is? 


EAR(a) =SAR(A) —SAR(A — a) (1) 


where EAR(a) is the number of species endemic to subarea a that is 
nested within the regional area A, SAR(A) is the total number of 
species in the region, and SAR(A — a) is the number of species in 
the complementary area A — a. 

EAR(a) is the number of species immediately lost if habitat area a is 
destroyed. EAR(a) is usually not known because data on the global 
distribution of species are not available. Traditionally, EAR(a) is 
obtained by substituting a SAR model, usually the power-law SAR 
model, into equation (1). However, by making this substitution, our 
paper’ shows that one inevitably overestimates the average, or 
expected, extinction rate. This so-called backward SAR method is a 
method for estimating endemic species, not ‘extinction debt’. The 
backward SAR method has nothing to do with, and does not measure, 
extinction debt. We do not question the existence of extinction debt, 
but to measure extinction debt it is necessary to use other methods. 

There are four reasons that the arguments of Pereira et al. are not 
substantiated. First, Pereira et al.! commit a statistical error by confus- 
ing a specific configuration of landscape destruction with the statistical 
expectation. The SAR is a macroecological pattern defined as the 
expected number of species as a function of area. The word ‘always’ 
in the title of our paper’ refers to the fact that the expectation of 
extinction rate is always biased too high if one uses the backward 
power-law SAR method. One certainly cannot trust any single specific 
case of the extinction rate estimated in this manner to be reliable, and 
our result is a general proof that shows that the average extinction rate 
so estimated is always an overestimate. 

Second, if what Pereira et al. say is correct, then the outward EAR and 
the inward EAR must be different, but they are not different in their own 
analysis of the Yasuni plot (figure 2c in ref. 1), undermining their claim. 
The configuration of destruction can matter only to specific samples, 
but does not eliminate the bias we show exists in the statistical expecta- 
tion. It is unclear why outward-inward destruction should be so special, 
versus, for example, left-to-right or up-to-down destruction. Clearly, a 
specific destruction pattern cannot represent the general expectation 
because it is just one sample of many possible patterns of destruction. 

Third, Pereira et al. compare the SARs between the inward and 
outward configurations for the Barro Colorado Island (BCI) and 
Yasuni plots and argue that the inward EAR can be predicted by 
the backward SAR for the two plots because the Zs,p values for both 
configurations are similar. However, a close scrutiny of the method 
used to calculate these z values shows that the result is the outcome of 
post-hoc selected ranges of area over which they chose to fit the SARs. 
For the BCI plot, they used 1 ha as the minimal area, but they used a 
5 ha minimum for the Yasuni plot (figure 2b, d in ref. 1). The problem 
is that one can obtain practically any z value by arbitrarily varying the 
minimal area. This is because in small areas the SAR is only approxi- 
mately a power law, and including small areas when fitting the power- 
law SAR model inflates the z value. This arbitrary post-hoc selection 
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of z values invalidates their comparison. The minimum area in our 
study’ is consistently set to be =0.2 ha across all plots to ensure that 
the analyses are standardized and comparable and that the log-log 
SARs are adequately linear with R* > 0.92. Using a consistent minimum 
area, one does not obtain their result. 

Fourth, Pereira et al. argue that island SARs are more appropriate 
models for estimating extinction rates. This is not correct. Regardless 
of what you call the SAR or the reason why island SARs generally have 
steeper slopes than continental SARs, people use the same backwards 
SAR model to estimate extinction rates on continents and in island 
archipelagoes. In instances in which z values are not available, 
researchers universally use z = 0.25 (refs 4, 5). 

We do not disagree with Thomas and Williamson? that extinction 
is caused by many factors, not just habitat loss, including climate 
change, and we also agree that extinction is real and happening at 
elevated rates. All we have shown is that the backward SAR method is 
not appropriate for estimating extinction rates caused by habitat loss. 
Any extinction rates estimated from that method are questionable. 
Weare well aware that species extinction can be evaluated by a variety 
of methods. Not all of the extinction estimates in the Millennium 
Ecosystem Assessment used the flawed backwards power-law 
method. We did not question or assess the validity of those methods 
because our study does not apply to them. We also did not criticize the 
methods used by the Intergovernmental Panel on Climate Change or 
the International Union for Conservation of Nature to estimate 
extinctions, contrary to misquotes in the press. 

For further information, a JAVA program written by G. Acevo that 
computes SAR and EAR curves and expectations for model com- 
munities is available for download from http://shubbell.eeb.ucla.edu/ 
earsar.php. 
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Opposite effects of fear conditioning and extinction 
on dendritic spine remodelling 


Cora Sau Wan Lai!, Thomas F. Franke? & Wen-Biao Gan! 


It is generally believed that fear extinction is a form of new learning 
that inhibits rather than erases previously acquired fear memories’ *. 
Although this view has gained much support from behavioural and 
electrophysiological studies''°, the hypothesis that extinction causes 
the partial erasure of fear memories remains viable. Using trans- 
cranial two-photon microscopy'’”’, we investigated how neural 
circuits are modified by fear learning and extinction by examining 
the formation and elimination of postsynaptic dendritic spines of 
layer-V pyramidal neurons in the mouse frontal association cortex. 
Here we show that fear conditioning by pairing an auditory cue with 
a footshock increases the rate of spine elimination. By contrast, fear 
extinction by repeated presentation of the same auditory cue without 
a footshock increases the rate of spine formation. The degrees of 
spine remodelling induced by fear conditioning and extinction 
strongly correlate with the expression and extinction of conditioned 
fear responses, respectively. Notably, spine elimination and forma- 
tion induced by fear conditioning and extinction occur on the same 
dendritic branches in a cue- and location-specific manner: cue- 
specific extinction causes formation of dendritic spines within a 
distance of two micrometres from spines that were eliminated 
after fear conditioning. Furthermore, reconditioning preferentially 
induces elimination of dendritic spines that were formed after extinc- 
tion. Thus, within vastly complex neuronal networks, fear condition- 
ing, extinction and reconditioning lead to opposing changes at the 
level of individual synapses. These findings also suggest that fear 
memory traces are partially erased after extinction. 

Classical fear conditioning is widely used to study associative learn- 
ing in which a conditioned neutral stimulus (CS; for example an 
auditory cue) is paired with the presentation of an unconditioned 
aversive stimulus (US; for example a footshock) to elicit a conditioned 
response’? (CR; for example a freezing response to CS in the absence of 
US). Repeated exposures to CS diminish the expression of the CR, a 
process called extinction’ ’. It is widely believed that fear extinction 
involves new learning of the ‘safe’ association between CS and the 
absence of US, rather than an erasure of the original CS-US asso- 
ciation’*. This theory is supported by behavioural studies of 
spontaneous recovery, renewal and reinstatement of fear memory after 
extinction’®. Furthermore, fear conditioning and extinction regulate 
activities of non-overlapping neuronal populations in the amygdala, 
hippocampus and frontolimbic cortex’”""°. Although these behavioural 
and electrophysiological studies suggest that fear memories are stored 
in different circuits from extinction memories, they do not exclude 
the possibility that extinction may cause a partial erasure of fear 
memory traces. In support of the second view, it has been found that 
spontaneous recovery of fear memories is minimal after extinction 
training in young animals'*. Furthermore, fear conditioning and 
extinction are accompanied by an increase or decrease in synaptic 
activity in amygdala'° and in the expression level of signalling 
molecules involved in memory formation'®’. 

To investigate how fear conditioning and extinction affect synaptic 
circuits to result in opposite behavioural responses, we used transcranial 


two-photon microscopy to examine the formation and elimination of 
postsynaptic dendritic spines of layer-V pyramidal neurons in the dorsal 
medial region of the frontal association cortex’?’*"? (FrA; Fig. la). We 
chose FrA to investigate synaptic changes associated with fear condi- 
tioning and extinction for the following reasons. First, consistent with 
previous reports in rats*’’’, we observed reciprocal connections 
between the mouse FrA and amygdala (Fig. 1b, c and Supplementary 
Figs 1 and 2), suggesting that this cortical region interacts directly with 
the amygdala to participate in fear learning and extinction. Second, 
consistent with findings in rats that FrA is important for fear memory 
consolidation”, we found that after fear conditioning or extinction, 
inactivation of the mouse FrA by muscimol injection impaired the 
consolidation of fear and extinction memories (Supplementary Fig. 3). 
Lastly, unlike prelimbic and infralimbic regions that are important for 
fear expression and extinction but located deep in the brain”**’, FrA is 
accessible for in vivo two-photon imaging of dendritic spine plasticity. 

Using two-photon microscopy and YFP-expressing transgenic 
mice'’””, we first examined whether fear conditioning affected spine 
formation and elimination over 48 h in FrA (Fig. 1d, f). In these experi- 
ments, one-month-old mice were imaged and then subjected to one 
of four stimulus conditions: three tones each paired with a co- 
terminating footshock; three tones each temporally unpaired from a 
footshock; three tones only; or three footshocks only. Forty-eight hours 
after fear conditioning, we put mice through a tone-cued recall test to 
assess conditioned freezing responses, followed by a second imaging 
session to examine spine dynamics. We found that only the CS-US 
paired group, but none of the other groups, showed robust freezing 
responses during the recall test (F(3,16) = 18.689, P< 0.001; Fig. le). 
Notably, only the paired group showed a significant increase in spine 
elimination after 48 h when compared with the unpaired, tone-only or 
shock-only groups (17.4 + 2.3% versus 9.7 + 1.9%, 10.1 + 1.1% or 
10.0% + 1.3%, respectively; F(3,16) = 24.569, P < 0.001; Fig. 1f; see also 
Supplementary Fig. 4). We did not observe a significant difference in 
spine formation among the four groups (Fis. 16) = 0.164, P > 0.9; Fig. 1f 
and Supplementary Fig. 4). There were also no significant differences 
in the formation and elimination rates of dendritic filopodia, that is, 
precursors of dendritic spines”, among different groups (Supplemen- 
tary Fig. 5). These results indicate that fear conditioning by CS-US 
association causes dendritic spine elimination over 2 d in FrA. 

To investigate the impact of fear conditioning further, we examined 
freezing responses and spine dynamics in the paired and unpaired 
groups 9 d after conditioning. A recall test showed that mice subjected 
to the paired CS-US stimuli showed a higher level of conditioned 
freezing responses than mice exposed to the unpaired CS-US stimuli 
(P < 0.001; Fig. 1g). Furthermore, spine elimination over 9 d was sig- 
nificantly higher in the paired group than in the unpaired control 
group (P < 0.05; Fig. 1h). No significant differences in spine formation 
were observed between the two groups (P > 0.05; Fig. 1h). Thus, spine 
elimination induced by fear conditioning represents a long-lasting 
synaptic change in FrA. In addition, we found a significant increase 
in spine elimination but not formation as early as 24h after fear 


1Molecular Neurobiology Program, Skirball Institute, Department of Physiology and Neuroscience, New York University School of Medicine, 540 First Avenue, New York, New York 10016, USA. 
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Figure 1 | Fear conditioning causes spine elimination. a, Diagram of a 
coronal section of frontal association cortex (FrA) showing the imaging site 
(cyan bar). PL, prelimbic cortex. b, c, Neurons labelled with Mini-Ruby dye 
(b, red) in FrA have extensive axonal arborizations in amygdala (c, red). The 
arrow and arrowhead in c respectively indicate yellow fluorescent protein 
(YFP)-positive and -negative neuronal soma labelled with Mini-Ruby in 
amygdala (see also Supplementary Fig. 1). d, Representative images of dendrites 
before and after conditioning in the unpaired and paired groups. Arrows and 
arrowheads indicate spine formation and elimination, respectively. Asterisks 
mark filopodia. e, Percentage of freezing in different groups before and during 


conditioning in FrA (Supplementary Fig. 6a). However, no significant 
increases in dendritic spine elimination or formation were observed 
24h after fear conditioning in the barrel cortex (Supplementary Fig. 6b). 
Together, these results indicate that fear conditioning causes rapid and 
long-lasting spine elimination in FrA, but not in all cortical regions. 

Notably, we observed that, 48 h after fear conditioning, the percentage 
of spine elimination correlated significantly with the degree of freezing 
responses to CS (r = 0.765 (correlation coefficient), P< 0.001; Fig. 1i). 
By contrast, there was no significant correlation between the degree of 
spine formation and freezing responses (r = —0.201, P > 0.3; Fig. 1)). 
Furthermore, 9 d after training, freezing responses in the paired group 
correlated significantly with spine elimination (r= 0.895, P< 0.05; 
Fig. 1k) but not spine formation (r= 0.055, P> 0.3; Fig. 11). These 
findings suggest that spine elimination induced by fear conditioning 
represents an important synaptic change that strongly predicts the con- 
ditioned freezing response. 

To determine whether fear extinction affects spine dynamics, we 
subjected mice to fear conditioning as before, followed by 2 d of extinc- 
tion training through repeated presentation of CS in the absence of 
footshocks (five trials per day) (Fig. 2a). Dendritic spines in FrA were 
imaged before and after extinction training, and spine dynamics were 
compared between fear-conditioned mice with and without extinc- 
tion. Consistent with the reported effectiveness of our extinction pro- 
tocol’, the conditioned freezing response was significantly reduced in 
the extinction group (P< 0.001) but not in the no-extinction group 
(P > 0.3; Fig. 2b). After 2 d of extinction, we found that spine forma- 
tion was significantly higher in the extinction group than in the no- 
extinction group (16.5 + 2.6% versus 8.3 + 1.2%, P< 0.001; Fig. 2c). 
By contrast, no significant difference in spine elimination was 
observed after extinction (P > 0.2; Fig. 2c). There were also no signifi- 
cant differences in the formation and elimination rates of dendritic 
filopodia after extinction (Supplementary Fig. 7). Notably, the freezing 
response to CS after extinction showed a significant inverse correlation 
with spine formation (r = —0.809, P < 0.01; Fig. 2d) but not with spine 
elimination (r = —0.250, P > 0.4; Fig. 2e). Thus, by contrast with fear 
conditioning, extinction induces new spine formation, the degree of 
which predicts the effectiveness of extinction training in reducing 
conditioned freezing responses. 
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CS presentation 48 h after conditioning. f, Percentage of spine elimination and 
formation 48 h after conditioning (n = 5 for each group). Only the paired group 
showed an increase in freezing response (e) and spine elimination 

(f). g, Freezing responses of the paired and unpaired groups 9 d after 
conditioning. h, Percentage of spine elimination and formation over 9 d. 
(Unpaired, n = 4; paired, n = 5.) i-l Freezing response correlated with spine 
elimination but not formation over either 48 h (i, j) or 9 d (k, I). Each circle in 
i-l represents an animal. ***P < 0.001, *P < 0.05. Data show mean + s.e.m. 
(e, g) or mean + s.d. (f, h). Scale bars: 100 [um (b, c); 4 um (d). 


A recent study has shown that a fraction of new spines induced by 
novel sensory and motor learning experiences persist over months’”. 
To determine whether new spines induced by extinction are long 
lasting, we subjected mice to fear conditioning followed by extinction 
training as described above. On day 12,7 d after extinction training, we 
performed a recall test to assess the extinction memory and then re- 
imaged mice to determine the persistence of spines formed over the 
2-d extinction training (Fig. 2a). We found that the acquired extinction 
memory was intact 7 d after extinction training (Fig. 2f). Furthermore, 
the percentage of new spines persisting until day 12 was inversely 
correlated with the freezing response to the CS (r= —0.899, 
P< 0.05; Fig. 2g). Thus, long-lasting new spines induced by extinction 
training may contribute to the preservation of the extinction memory. 

Our results so far indicate that fear conditioning predominantly 
promotes spine elimination whereas extinction mainly induces spine 
formation. To determine whether fear conditioning and extinction 
cause spine remodelling on the same or different dendritic branches, 
we measured the percentage of spine elimination after fear condition- 
ing and the percentage of spine formation after extinction along indi- 
vidual dendritic branches 15-50uum in length (average length, 
27.6 + 9.3 um). We found that the percentage of spine elimination 
after fear conditioning positively correlated with the percentage of 
spine formation after extinction (“CS/tone-A extinction’; n = 6 mice, 
40 branches, r = 0.458, P < 0.01; Fig. 3b, e). By contrast, no significant 
correlation was observed in the no-extinction group (n = 5 mice, 25 
branches, r = 0.240, P > 0.2; Fig. 3a, d). Furthermore, when a different 
cue (tone B) was presented repeatedly instead of the cue originally used 
for fear conditioning (tone A), no significant correlation was observed 
between spine elimination and formation (‘tone B’} nm =5 mice, 41 
branches, r = 0.231, P > 0.1; Fig. 3c, f). These results suggest that cue- 
specific extinction induces spine formation on the dendritic branches 
on which fear conditioning previously caused spine elimination. 

Further to understand how fear conditioning and extinction impact 
synaptic connectivity on the same dendritic branches, we measured 
the distance between the site of conditioning-induced spine elimina- 
tion and the closest site of extinction-induced spine formation. In the 
extinction group, 57.3 + 5.5% of extinction-induced new spines were 
located within 2 1m of either side of a spine that had previously been 
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Figure 2 | Fear extinction induces spine formation. a, Timeline of 
experimental manipulations and imaging. b, Percentage of freezing during the 
recall test and extinction training. Freezing in the extinction group (n = 10) was 
lower than the no-extinction group (m = 5, P< 0.001). ¢, Extinction 
significantly increased spine formation (P < 0.001). d, e, Percentage of freezing 
after extinction was inversely correlated with spine formation (d) but not 
elimination (e). Extinction training was performed on two consecutive days 
(Ext-1 and Ext-2). Freezing in the last trial of Ext-2 in b was plotted in d and 
e. f, Percentage of freezing in the recall tests and during the last trials of 
extinction on days 4 and 5. Decreased freezing after extinction persisted for 7 d 
as shown in the recall test (R-3). There was no significant difference in freezing 
between the last trial of Ext-2 and R-3 (n = 6, P> 0.1). g, New spines induced 
by extinction and persisting on day 7 inversely correlated with freezing 
response. ***P < 0.001. Data show mean = s.e.m. (b, f) or mean + s.d. (c). 


eliminated after conditioning. In the groups that did not undergo 
extinction or were exposed to tone B instead of the conditioned stimu- 
lus (CS/tone A), only 24.9+ 1.4% or 244+ 4.9% of new spines, 
respectively, were formed within 2 1m of spines previously eliminated 
by conditioning (Fig. 3g). Moreover, within a distance of 2 ,1m from 
sites of spine elimination, ~80% of newly formed spines in all groups 
were oriented within 90° of previously eliminated spines. Beyond 
2m, the likelihood of newly formed spines being oriented within 
90° of previously eliminated spines decreased to chance levels 
(~50%) (Fig. 3h). These results indicate that after extinction training, 
new spines tend to form within close proximity and orient in the same 
direction as spines that were eliminated after fear conditioning. 
Furthermore, we estimated that the number of presynaptic boutons 
available for a synaptic contact with a spine that is 2 um long and 
located along a 4-um dendritic segment is ~32 (Supplementary 
Information, section 1). When considering that new spines tend to 
be oriented in the same direction as previously eliminated spines, our 
results indicate that there could be a ~1/16 chance that new spines will 
contact the same synaptic boutons as previously eliminated spines. 
So far, our data have indicated that extinction with the US-associated 
conditioned stimulus (tone A), but not with an unconditioned stimulus 
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Figure 3 | Fear conditioning and extinction cause location-specific spine 
remodelling. a-c, Correlations between spine elimination 48 h after fear 
conditioning and spine formation on individual dendritic branches under the 
following three conditions: no extinction training (a); after extinction training 
using the conditioned tone (b); after repeated exposure to a novel tone 

(c). d-f, Representative images of dendritic branches taken before and after 
conditioning from the three groups. Arrowheads mark sites of spine 
elimination induced by conditioning. Arrows mark sites of newly formed 
spines under the three conditions. The red dot in e marks a new spine formed 
after extinction and located within 2 um of a spine previously eliminated after 
conditioning. Asterisks mark filopodia. Scale bar in f, 4 um. g, Spine 
distribution graph depicting the relative distance between sites of spine 
elimination induced by conditioning and the respective closest sites of spine 
formation under the three conditions. h, Percentage of newly formed spines 
facing the same direction as previously eliminated spines (oriented within 90° 
relative to the eliminated spines). ***P < 0.001, **P < 0.01. Data show 
mean = s.e.m. (g). 


(tone B), induces spine formation in close proximity to spines that 
were eliminated after fear conditioning. Further to investigate the 
cue specificity and location specificity of extinction-induced spine 
formation, we fear-conditioned mice to tone A (CS1) and tone B 
(CS2) in two consecutive training sessions using three pairings of 
CS1 with US and CS2 with US (Fig. 4a). Eliminated spines induced 
by CS1-US or CS2-US pairings were identified separately over 4 d. In 
recall tests on day 2 (R-1) and day 4 (R-2), mice showed freezing 
responses to both CS1 and CS2. We then subjected mice to extinction 
training with CS2 for another 2 d and newly formed spines induced by 
extinction were identified (Fig. 4a). On day 7, after CS2 extinction, 
mice showed a low freezing response to CS2 but a high freezing res- 
ponse to CS1, demonstrating the cue specificity of our extinction train- 
ing (Fig. 4b). Consistent with our results in Figs 1 and 2, fear 
conditioning with both CS1 and CS2 increased spine elimination 
whereas extinction with CS2 promoted spine formation (Fig. 4c). 
Notably, a significantly larger population of newly formed spines 
induced by extinction with CS2 were located within 2 1m of spines 
that were eliminated by CS2-US (‘CS2 elim/CS2 form’) than by CS1- 
US (‘CS1 elim/CS2 form’) (45.9 + 7.3% versus 26.6 + 3.8%, P< 0.05; 
Fig. 4d). These findings indicate that extinction training with a specific 
auditory cue induces spine formation in close proximity to spines 
previously eliminated after fear conditioning against the same cue. 
Further to investigate the cue and location specificity of opposing 
synaptic changes after fear conditioning and extinction, we tested 
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Figure 4 | Extinction induces spine formation in a cue- and location-specific 
manner. a, Timeline of experimental manipulations and imaging. b, Percentage 
of freezing during the recall test and extinction. After extinction with CS2, mice 
showed a lower freezing response to CS2 than to CS1 on day 7 (P< 0.001). 

c, Conditioning to either CS1 or CS2 increased spine elimination when 
compared with unpaired controls (P < 0.001). Extinction with CS2 increased 
spine formation when compared with the unpaired controls, CS1-US or CS2-US 
(Fa,16) = 18.540, P< 0.001). d, Spine distribution graph depicting the relative 
distance between new spines induced by CS2 extinction and eliminated spines 
induced by CS1-US or CS2-US. A significantly larger population of new spines 
induced by CS2 extinction were located within 2 |1m of spines previously 
eliminated by CS2-US than were located within 2 1m of spines previously 
eliminated by CS1-US (n = 5 mice, 36 branches, 74 spines, P< 0.05). 

***D < ().001, *P < 0.05. Data show mean = s.e.m. (b, d) or mean = s.d. (c). 


whether new spines induced by extinction are selectively eliminated in 
a cue-specific manner after reconditioning. In this experiment, we first 
subjected mice to fear conditioning and extinction, and then recondi- 
tioned them by re-exposure to five CS1-US (CS1/tone A) pairings on 
days 5 and 6 (Fig. 5a). As control groups, mice were either subjected to 
five temporally unpaired presentations of CS1 and US stimuli or con- 
ditioned with five pairings of CS2-US pairings (CS2/tone B) on days 5 
and 6 (Fig. 5a-d). Recall tests showed a significant increase of the 
freezing response to CS1 in the reconditioning group when compared 
with the unpaired group or the group conditioned with CS2-US 
(P< 0.01; Fig. 5e). We next compared the respective persistences of 
newly formed spines located within 21m or more than 2 um from 
previously eliminated spines in the three groups. The persistence of 
extinction-induced new spines within the 2-11m boundaries in the 
reconditioning group (11.1 + 1.8%) was significantly lower than that 
in the unpaired or CS2-US conditioned groups (32.0 + 6.2% and 
35.2 + 11.3%, respectively) (P< 0.05). There was no significant differ- 
ence in the persistence of spines formed outside the 2-j1m boundary 
among the three groups (P > 0.6; Fig. 5f). Thus, fear reconditioning 
opposes the effects of extinction on synaptic remodelling with high 
anatomical and cue specificity. 

It is well accepted that synaptic reorganization is critical for learning 
and memory'*’. However, it is unclear how synaptic circuits are 
modified by opposing forms of learning and how such modifications 
contribute to opposite behavioural outcomes. Our studies indicate that 
fear conditioning, extinction and reconditioning cause opposing synaptic 
modifications on the same dendritic branches in a cue- and location- 
specific manner. These findings also suggest that extinction induces 
at least partial erasure of fear memory traces in FrA. FrA in rodents 
has reciprocal connections with multiple brain areas including the 
amygdala’, and its inactivation impairs fear learning and extinc- 
tion” (Fig. 1b, c and Supplementary Figs 1-3), suggesting that this 
region is directly involved in modulating freezing behaviours. Future 
investigations are needed to understand the mechanisms underlying 
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Figure 5 | Reconditioning eliminates spines formed during extinction. 

a, Timeline of experimental manipulations and imaging. b-d, Representative 
images before and after fear conditioning, after extinction and after 
presentation of the unpaired stimuli (b), after reconditioning to CS1 (c) or after 
new conditioning to CS2 (d). Arrowheads mark sites of spine elimination after 
fear conditioning. Arrows and red dots mark new spines that were formed after 
extinction and located more than 2 |im (arrows) and within 2 jum distance (red 
dots) from previously eliminated spines after conditioning. Asterisks mark 
filopodia. Scale bar, 4 1m. e, Reconditioning (RC) increased freezing by 
comparison with unpaired stimuli (UP) or new conditioning to CS2 (CS2-US) 
during recall test R-2 with CS1 (**P < 0.01). f, The survival rate of new spines 
induced by extinction training and located within 2 1m of previously 
eliminated spines was significantly lower after RC-CS1 (54 branches, 107 total 
new spines) than after UP (58 branches, 107 total new spines) or CS2-US (54 
branches, 94 total new spines) (**P < 0.01, *P < 0.05). Data show 

mean = s.e.m. (e) or mean = s.d. (f). 


the opposite changes of synaptic connections in FrA and how such 
changes contribute to the acquisition, extinction and reinstatement of 
fear memories. 


METHODS SUMMARY 


One-month-old male mice expressing YFP (H-line) were used in this study. Fear 
conditioning was conducted with three pairings of a 30-s, 80-dB auditory cue 
(400 Hz; tone A= CS1 or 1,200 Hz, tone B = CS2) co-terminating with a 2-s, 
0.5-mA scrambled footshock (US). Extinction training was conducted with five 
CS presentations (each CS lasting 2 min with an intertrial interval of 2 min) per day 
for 2d. The procedures of in vivo transcranial two-photon imaging and data 
quantification were as described previously'’”*. Either analysis of variance or 
Student’s t-test was used to compare spine remodelling and freezing responses 
among different groups. The Pearson correlation coefficient was used to measure 
the strength of linear dependence between different variables. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Animals. C57BL/6 mice expressing YFP in layer-V pyramidal neurons (H-line) 
were purchased from the Jackson Laboratory and group-housed in the Skirball 
animal facility. One-month-old (P30 + 1) male mice were used in the experi- 
ments. All experiments were approved and performed in accordance with insti- 
tutional guidelines. 

Fear conditioning and extinction. Apparatus. Mice were trained and tested using 
the FreezeFrame system (Coulbourn Instruments). For training, mouse test cages 
equipped with stainless-steel shocking grids were connected to a precision 
feedback current-regulated shocker (Coulbourn Instruments). For testing, the 
shocking grids were replaced with non-shocking test grids that differed in texture 
from the shocking grids used during conditioning. Each test cage was contained in 
a sound-attenuating enclosure (Coulbourn Instruments). Behaviour was recorded 
using low-light video cameras. Stimulus presentation was automated using 
Actimetrics FreezeFrame software (version 2.2; Coulbourn Instruments). All 
equipment was thoroughly cleaned with detergent followed by water between 
sessions. 

Fear conditioning. Mice were habituated for 2 min on a shocking grid (cage 
set-up A: shocking floor grids, ethanol scent). Fear conditioning was conducted with 
three pairings of a 30-s, 400-Hz, 80-dB auditory cue (tone A = CS1) co-terminating 
with a 2-s, 0.5-mA scrambled footshock (US). The intertrial interval was 15 s. Two 
minutes after conditioning, mice were returned to their home cages. For the 
unpaired control group, mice received tones and shocks in an unpaired manner 
(tones and shocks were separated by random intervals of 5-15s). Mice were 
returned to their home cages 2 min after presentation of the unpaired stimuli. For 
tone-only or shock-only control groups, mice were habituated for the same amount 
of time in the testing context, and tones or footshocks were given separately. 

Recall test and extinction. For the recall test, mice were placed in a different 
context (cage set-up B: test floor grids, 1% Pinesol) for an initial 2-min (pre-CS) 
period and this was followed by tone presentation for 2 min (CS). For extinction 
training, mice were subjected to five CS presentations (each lasting 2 min with an 
intertrial interval of 2 min) per day for two consecutive days. For repeated exposure 
of mice to tone B (C82), the extinction protocol was used with a different auditory 
cue (1,200 Hz, 80 dB). 

Reconditioning. Mice were reconditioned in the training context using shock- 
ing grids (ethanol scent) with five pairings of CS-US (30-s CS/tone A, 400 Hz; each 
co-terminated with a 2-s footshock) each day on days 5 and 6. For the unpaired 
control, five shocks and tones were given to mice in an unpaired manner each day. 
For conditioning with tone B, five CS—US pairings were given to the mice using 
tone B as the auditory cue each day. 

Retrograde and anterograde tracing with Mini-Ruby. One-month-old male mice 
expressing YFP were anaesthetized with ketamine and xylazine (intraperitoneal; 
20mg ml ' and 3 mg ml’, respectively, in saline; 6 il per gram of body weight). 
Mini-Ruby (Invitrogen) dissolved at 5% concentration in water was injected into 
the frontal association cortex (+2.8mm bregma, +1.0mm midline, +0.01 mm 
ventral) through a sharp electrode by iontophoresis (6 1A, on-off for 15 min). 
For amygdala injections, 0.15 il of Mini-Ruby was injected over 5 min into the 
amygdala with a Hamilton syringe (—1.94mm bregma, +3.00mm midline, 
+4.75 mm ventral). Four to seven days after the injection, mice were perfused with 
4% paraformaldehyde and their brains were postfixed overnight. Brains were 
sectioned with a vibratome at 200 tim. Confocal images were acquired using a 
Bio-Rad confocal microscope (X20 oil lens; numerical aperture, 0.8). 

Transient inactivation of the frontal association cortex with muscimol. After 
fear conditioning or extinction, mice were anaesthetized with ketamine and 
xylazine. Muscimol (Sigma; 0.5 pl at 1 1g pl’) in artificial cerebrospinal fluid or 
artificial cerebrospinal fluid as vehicle was microinjected bilaterally into the frontal 
association cortex (+2.8 mm bregma, +1.0 mm midline, +0.5 mm ventral) with a 
pressure injection device (Picospritzer III; 15 p.s.i., 12 ms, 0.8 Hz) over 5 min. The 
injection was performed within 1h after fear conditioning or extinction. Twenty- 
four hours after injection, mice were subjected to a recall test. Muscimol spread 
was estimated by the line at which the Mini-Ruby fluorescence was less than 20% 
of its peak level. On the basis of this definition, we determined the range of 
muscimol spread as ~500-700 tm (Supplementary Fig. 3). 


In vivo transcranial two-photon imaging. Spine formation and elimination were 
examined by imaging the mouse cortex through a thinned-skull window as 
described previously'’”’. Briefly, one-month-old male mice expressing YFP were 
anaesthetized with ketamine and xylazine (intraperitoneal; 20mgml_' and 
3mg ml’, respectively, in saline; 6 jl per gram of body weight). Thinned-skull 
windows were made in head-fixed mice with high-speed microdrills. Skull 
thickness was reduced to about 201m. A two-photon microscope tuned to 
920 nm (X60 water immersion lens; numerical aperture, 1.1) was used to acquire 
images. For re-imaging of the same region, thinned regions were identified on the 
basis of the maps of the brain vasculature. Microsurgical blades were used to re- 
thin the region of interest until a clear image could be obtained. The area of the 
imaging region was 200 um X 200 pm. The centres of the imaging regions were as 
follows: +2.8mm bregma, +1.0mm midline (frontal association cortex); 
—1.1mm bregma, +3.4mm midline (barrel cortex). Because repeated thinning 
of the skull to ~20 jum without damaging the cortex becomes more difficult after 
multiple imaging sessions, we designed our experiments in such a way that no 
animal was imaged than four times. 

Data analysis. All data analysis was performed blind to treatment conditions, 
except for the data in Supplementary Fig. 4, which were collected under behavioural 
treatment conditions known to the investigator. Image J software was used to 
analyse spine elimination and formation from three-dimensional image stacks as 
described previously''’’. Dendritic branches were randomly sampled within a 
200 jum X 200 jum area imaged at a distance of 0-100 j1m below the pia surface. 
The same dendritic segments were identified from three-dimensional stacks taken at 
different time points with high image quality (ratio of signal to background noise, 
>4:1). The number and location of dendritic protrusions (protrusion lengths were 
more than one-third of the dendritic shaft diameter) were identified. The percentage 
of spine formation and elimination is presented as the number of spines formed or 
eliminated between the first and second view divided by the total number of spines 
observed at the first view’””” (Supplementary Information, section 2). 

For the analysis of spine changes at the individual branch level (Fig. 3), the 
percentage of spine formation and elimination is presented as the number of 
formed or eliminated spines divided by the total number of spines on the indi- 
vidual branch. For the spine orientation analysis, newly formed spines were con- 
sidered to have the same orientation when they were oriented within 90° relative to 
previously eliminated spines. 

Filopodia were identified as long, thin structures'"”” (generally larger than twice 
the average spine length; ratio of head diameter to neck diameter, <1.2:1; ratio of 
length to neck diameter, >3:1). The remaining protrusions were classified as spines. 
No subtypes of spines were distinguished in our analysis. Three-dimensional stacks 
were used to ensure that tissue movements and rotation between imaging intervals 
did not hinder spine identification. Spines or filopodia were considered to be 
identical between views if their positions were unchanged with respect to adjacent 
landmarks. Our quantitative analysis shows that we can measure the distance 
between two adjacent stable spines with a precision of ~0.2 lum in 95% of the cases 
(2s.d.). Spines were considered different if their positions differed from the first 
view by more than 0.7 um. We chose 0.7 um as a cut-off distance because spine 
positions can shift by up to ~0.3 jum in either direction along the axis of dendritic 
shafts owing to changes in spine morphology, slight tissue rotation and movements 
related to brain pulsation. We estimate that the imaging resolution of our two- 
photon microscope (60; numerical aperture, 1.1; 920 nm) is ~0.7 um. 

For image display, fluorescent structures near and out of the focal plane of the 
dendrites of interest were removed manually from image stacks using Adobe 
Photoshop. The modified image stacks were then projected to generate two- 
dimensional images and adjusted for contrast and brightness. 

For statistical analysis, we used either analysis of variance or Student's t-test to 
compare spine formation and elimination rates among different experimental 
groups. All spine formation and elimination rates are presented as mean + s.d. 
and all behavioural data are presented as mean + s.e.m. Data in spine distribution 
graphs in Fig. 3g and 4d are presented as mean + s.e.m. The Pearson correlation 
coefficient was used as a measure of the strength of linear dependence between 
spine changes and behavioural responses. The linear regression lines and correla- 
tion coefficients (r) are shown in Fig. 1i-l, Fig. 2d, e, g and Fig. 3a-c. In all analyses, 
P values less than 0.05 were considered to be statistically significant. 
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Free fatty acids provide an important energy source as nutrients, 
and act as signalling molecules in various cellular processes’. 
Several G-protein-coupled receptors have been identified as free- 
fatty-acid receptors important in physiology as well as in several 
diseases**-"*, GPR120 (also known as O3FAR1) functions as a 
receptor for unsaturated long-chain free fatty acids and has a critical 
role in various physiological homeostasis mechanisms such as 
adipogenesis, regulation of appetite and food preference*®*"*. 
Here we show that GPR120-deficient mice fed a high-fat diet 
develop obesity, glucose intolerance and fatty liver with decreased 
adipocyte differentiation and lipogenesis and enhanced hepatic 
lipogenesis. Insulin resistance in such mice is associated with 
reduced insulin signalling and enhanced inflammation in adipose 
tissue. In human, we show that GPR120 expression in adipose tissue 
is significantly higher in obese individuals than in lean controls. 
GPR120 exon sequencing in obese subjects reveals a deleterious 
non-synonymous mutation (p.R270H) that inhibits GPR120 sig- 
nalling activity. Furthermore, the p.R270H variant increases the 
risk of obesity in European populations. Overall, this study demon- 
strates that the lipid sensor GPR120 has a key role in sensing dietary 
fat and, therefore, in the control of energy balance in both humans 
and rodents. 

To investigate the role of GPR120 in metabolism, we examined 
GPR120-deficient mice (Supplementary Fig. 1) with respect to lipogen- 
esis, glucose and energy homeostasis. On a normal diet containing 13% 
fat, the body weight was similar in both GPR120-deficient and wild- 
type mice. However, when 5-week-old GPR120-deficient mice were fed 
a high-fat diet (HFD) containing 60% fat, their body weight increase 
was ~10% higher than that of wild-type mice on a HFD (Fig. 1a). The 
difference in HFD-induced body weight gain between wild-type and 
GPR120-deficient mice was marked at ~8-10 weeks old and reached a 
plateau at 13 weeks old. To assess energy expenditure and substrate 
utilization, we next performed indirect calorimetry on wild-type and 


mutant mice on a HFD at 9-10 weeks old (Fig. 1b) and 15-16 weeks old 
(Supplementary Fig. 2a). The young GPR120-deficient mice showed 
decreased energy expenditure compared with the young wild-type 
mice, particularly during the light/inactive phase (Fig. 1b, left), whereas 
older mutant and wild-type mice showed no such a difference 
(Supplementary Fig. 2a, left). The difference in energy expenditure 
between GPR120-deficient and wild-type mice was observed only in 
the light phase, indicating that the lack of the GPR120 receptor 
primarily affects basal metabolism, especially in young mice. The 
decreased energy expenditure might explain the difference we found 
in body weight gain between HFD-fed wild-type and mutant young 
mice. The lower values of respiratory quotient in mutant mice could be 
due to insufficient glucose utility, probably as a result of the decreased 
insulin sensitivity. In all experiments, both groups of mice showed 
similar levels of locomotor activity (data not shown). 

White adipose tissue (WAT) and liver were substantially heavier in 
HFD-fed GPR120-deficient mice (Supplementary Fig. 2b). Plasma 
low- and high-density lipoprotein cholesterol levels were significantly 
higher in HFD-fed GPR120-deficient mice, and serum alanine amino- 
transferase levels were substantially increased, indicating abnormal 
cholesterol metabolism and liver function (Supplementary Table 1). 
Microcomputed tomography scanning revealed that 16-week-old 
GPR120-deficient mice stored much more fat than did wild type 
(Fig. 1c). A significant increase in adipocyte size in both epididymal 
(Fig. 1d) and subcutaneous (Supplementary Fig. 2c) fat was observed 
in GPR120-deficient mice. Furthermore, the expression of macro- 
phage marker genes (Cd11b (Itgam), Cd68 and F4/80 (Emr1)) and 
the number of F4/80-positive cells were markedly enhanced in epidi- 
dymal tissue from HFD-fed GPR120-deficient mice (Fig. le, f). 
Moreover, these mice showed liver steatosis and hepatic triglyceride 
content was markedly increased (Fig. 1g). Overall, HFD-induced obesity 
and liver fattiness were more severe in GPR120-deficient mice than in 


wild type. 
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Obesity-associated insulin resistance was also more severe in 
GPR120-deficient mice. HFD-fed GPR120-deficient mice showed 
higher levels of fasting plasma glucose and insulin than did wild type, 
although these parameters were similar between the two groups on a 
normal diet (Fig. 2a). HFD-induced insulin resistance, as determined 
by an insulin tolerance test, was more prominent in GPR120-deficient 
mice than in wild type (Fig. 2b, left, and Supplementary Fig. 3a, b). 
A glucose tolerance test further revealed that these mice suffered 
from impaired glucose metabolism (Fig. 2b, right, and Supplementary 


Figure 1 | Obesity, hypertrophic adipocytes, accumulation of pro- 
inflammatory macrophages and hepatic steatosis in HFD-fed GPR120- 
deficient mice. a, Body weight changes of wild-type (WT) and GPR120- 
deficient mice fed a normal diet (ND) or a HED (n = 36-47). b, Indirect 
calorimetry in HFD-fed mice. Energy expenditure and respiratory quotient 
(n = 4, 5). c, Representative cross-sectional images of wild-type and GPR120- 
deficient mice subjected to microcomputed tomography analysis of the in situ 
accumulation of fat. Fat depots are demarcated (green) for illustration. 

d, Haematoxylin and eosin (H&E)-stained epididymal WAT and mean area of 
adipocytes (n = 6). Scale bar, 100 um. e, Relative expression of Cd11b, Cd68 and 
F4/80 messenger RNA in WAT (n = 6). a.u., arbitrary units. f, Representative 
images of epididymal WAT stained with anti-F4/80 antibody (arrows, F4/80- 
positive cells) and the number of F4/80 cells (n = 6). Scale bar, 100 jum. g, Oil 
Red O-stained liver and hepatic triglyceride content after 24hr fasting (n = 13). 
Scale bar, 50 jtm. All data represent mean + s.e.m. *P < 0.05 and **P <0.01 
versus the corresponding wild-type value. 


Fig. 3a, b). The level of plasma leptin was significantly higher in HFD- 
fed GPR120-deficient mice than in wild type (Supplementary Fig. 3c). 
However, there was no significant difference in terms of plasma 
adiponectin level or food intake between the two groups (Supplemen- 
tary Fig. 3d, e). HFD-fed GPR120-deficient mice showed a marked 
increase in the size of islets and KI67 (MKI67)-positive cells, suggesting 
adaptive enlargement of the B-cell mass in response to insulin resist- 
ance’”'* (Supplementary Fig. 3f, g). Moreover, we observed markedly 
reduced peripheral insulin sensitivity in tissues from HFD-fed GPR120- 
deficient mice (Fig. 2c). Insulin was shown to induce the phosphoryla- 
tion of AKT (AKT1) (on Ser 473) in WAT, liver and skeletal muscle, 
with similar intensities in wild-type and GPR120-deficient mice on a 
normal diet (Supplementary Fig. 3h). Consistent with the insulin res- 
istance reported above, HFD-fed GPR120-deficient mice showed loss of 
insulin-induced AKT phosphorylation in WAT and the liver. 

To determine the molecular basis of the metabolic changes in WAT 
and livers of GPR120-deficient mice, we performed gene expression 
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Figure 2 | Impaired glucose metabolism, adipogenesis and lipogenesis in 
HFD-fed GPR120-deficient mice. a, Fasting blood glucose and serum insulin 
levels (n = 6-15). b, Plasma glucose during insulin tolerance test (ITT, left) and 
glucose tolerance test (GTT, right) (n = 12-14). ¢, Phosphorylation of AKT 
(Ser 473) in WAT, liver and skeletal muscle after 24-hr fasting (n = 6, 7). 

NS, not significant. d, Relative mRNA expression of Fabp4 and Scd1 in WAT or 
Scd1 in liver (n = 6). e, Protein expression of IRB, IRS1, IRS2, SCD 1 and f-actin 
in WAT. f, Protein expression of IRS1, IRS2, SCD1 and B-actin in liver. 

g, Oil Red O-staining and triglyceride (TG) content of mouse embryonic 
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fibroblast (MEF)-derived adipocyte. Scale bar, 50 um. h, Relative mRNA 
expression in MEF-derived adipocyte (n = 6). i, The ratio of C18:1 to C18:0 in 
livers (n = 6-8). j, Non-esterified C16:1n7 palmitoleate in WAT and plasma 
(n = 4-7). k, The ratio of Scd1 mRNA expression in liver and WAT (n = 6, 7). 
1, The ratio of C16:1 to C16:0 in adipose tissues (n = 6-8). m, Hepatic Scd1 
mRNA expression in mice infused with vehicle or triglyceride:palmitoleate for 
6h (n= 4, 5). All data represent mean + s.e.m. *P < 0.05 and **P < 0.01 
versus the corresponding wild-type value. 
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analyses. We identified approximately 700 differentially expressed 
genes in WAT between HFD-fed GPR120-deficient and wild-type 
mice (Supplementary Fig. 4a). Connectivity mapping of these genes 
showed that pathways relating to insulin signalling and adipocyte 
differentiation were depressed, whereas those related to inflammation 
were enhanced in HFD-fed GPR120-deficient mice (Supplementary 
Fig. 4b). Quantitative real-time PCR (qRT-PCR) analysis confirmed 
the downregulation ofinsulin-signalling-related genes (Insr, Irs1 and Irs2), 
an adipocyte differentiation marker gene (Fabp4) and a lipogenesis- 
related gene (Scd1) in the epididymal fat from HFD-fed GPR120- 
deficient mice (Fig. 2d and Supplementary Fig. 3i). We identified 
approximately 100 differentially expressed genes in the liver between 
HFD-fed GPR120-deficient and wild-type mice (Supplementary Fig. 5). 
Notably, lipogenesis-related genes (Scd1 and Me1) and a fatty acid 
transporter gene (Cd36) were significantly upregulated in livers from 
GPR120-deficient mice. Quantitative RT-PCR analysis confirmed 
upregulation of Scd1 in the liver of GPR120-deficient mice (Fig. 2d). 

Western blot analysis further confirmed downregulation of IRf, 
IRS1 and SCD1 in adipose tissue of HFD-fed GPR120-deficient mice 
(Fig. 2e) but downregulation of IRS1 and IRS2 and upregulation of 
SCD1 in their livers (Fig. 2f). Hence, insulin-signalling-related molecules 
were downregulated by the lack of GPR120 in both adipose tissue and 
the liver. However, the expression of SCD1, the rate-limiting enzyme in 
the biosynthesis of mono-unsaturated fatty acids, was downregulated in 
adipose tissue but upregulated in liver. Furthermore, the expression of 
Scd1 and several adipogenic genes'*”” (Pparg, Fabp4 and Srebf1) was 
suppressed in mouse-embryonic-fibroblast-derived adipocytes from 
GPR120-deficient mice, indicating that GPR120 is required for normal 
adipogenesis, as previously reported in differentiating 3T3-L1 adipocytes 
depleted of endogenous GPR120 by short interfering RNA“ (Fig. 2g, h). 

To determine the effects of altered lipogenesis on lipid composition 
in GPR120-deficient mice, we performed lipidomics analysis in WAT, 
livers and plasma. Significant changes of major lipid clusters were 
observed (Supplementary Fig. 6). Notably, the hepatic concentration 
of oleate (C18:1n9c) was significantly higher in HFD-fed GPR120- 
deficient mice than in wild type. The ratio of C18:1 to C18:0, an 
indicator of SCD1 enzyme activity”, was markedly enhanced in 
livers from HFD-fed GPR120-deficient mice relative to wild type 
(Fig. 2i). Moreover, the levels of C16:1n7 palmitoleate, which has 
recently been recognized as a lipid hormone’, in WAT and plasma 
were significantly lower in HFD-fed GPR120-deficient mice than in 
wild type (Fig. 2j). In particular, lower levels of C16:1n7 palmitoleate 
were detected even in WAT of GPR120-deficient mice on a normal 
diet (Fig. 2j), which seems to be in good agreement with the suppressed 
Scd1 expression and the reduced SCD1 desaturation index’ (C16:1/ 
C16:0; Fig. 2k, I). Lipidomics analysis clearly illustrated dysregulated 
lipogenesis in GPR120-deficient mice, and showed the reduced pro- 
duction of lipid hormone C16:1n7 palmitoleate*. To determine 
whether the enhanced hepatic lipogenesis in GPR120-deficient mice 
was due to the reduced levels of C16:1n7 palmitoleate, we examined 
the effect of C16:1n7 palmitoleate treatment on hepatic Scd1 expres- 
sion. A 6-h infusion of triglyceride:palmitoleate markedly lowered the 
enhanced hepatic Scd1 expression in GPR120-deficient mice (Fig. 2m). 
The results indicated that the reduced C16:1n7 palmitoleate may 
explain the systemic metabolic disorders observed in GPRI120- 
deficient mice on a HFD, as palmitoleate has been proposed to be a 
bioactive lipid by which adipose tissue communicates with distant 
organs (such as liver) and regulates systemic metabolic homeostasis’. 
This study shows that dysfunction of GPR120 can be an underlying 
mechanism for diet-associated obesity and obesity-related metabolic 
disorders in mouse. 

The mouse data prompted us to assess the potential contribution of 
GPR120 to the development of obesity and its metabolic complications 
in humans. First, the expression levels of GPR120 in both subcutaneous 
and omental adipose tissues as well as in liver samples were compared 
between lean and obese subjects. Normoglycaemic obese patients and 


LETTER 


lean individuals (n = 14 in each group) were matched for age and 
gender (Supplementary Table 2). As previously described*"*, we con- 
firmed that GRP120 is barely expressed in liver of either lean or obese 
subjects (data not shown). By contrast, we found that GPR120 is well 
expressed in the adipose tissue of lean individuals (Fig. 3a). In addi- 
tion, human obesity is significantly associated with an increase in 
GPR120 expression in both subcutaneous and omental adipose tissues 
(1.8-fold increase; P = 0.0004 and P = 0.003, respectively). We also 
found that GPR120 expression in subcutaneous adipose tissue 
strongly correlates with that in omental adipose tissue (Spearman 
analysis; r = 0.570 and P = 2.74 X 10°), suggesting a systemic regu- 
lation of its expression in humans. Furthermore, we found a positive 
correlation between GPR120 expression in both subcutaneous and 
omental adipose tissues and in subjects’ concentrations of plasma 
low-density lipoproteins (on adjustment for age and sex; r = 0.247, 
P= 0.0115 and r= 0.255, P = 0.0118, respectively). 

To investigate the contribution of GPR120 to human obesity, the four 
GPRI120 exons were sequenced in 312 French, non-consanguineous, 
extremely obese children and adults (Supplementary Table 3). We 
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Figure 3 | GPR120 expression in human obese tissue samples, and effect of 
GPR120 variants on [Ca**]; response and GLP-1 secretion. a, GPR120 
mRNA levels in human subcutaneous (SC) and omental (OM) adipose tissues of 
lean (LN; n = 14) and obese (OB; n = 14) normoglycaemic individuals. Mann- 
Whitney analysis, ***P = 0.0004 and **P = 0.003. b, ALA-induced [Ca?*], 
responses in cells expressing wild-type GPR120 or a p.R67C or p.R270H variant. 
c, ALA-induced GLP-1 secretion in NCI-H716 cells expressing a wild-type 
GPR120, a p.R67C ora p.R270H receptor. d, Effect of transfection with GPR120 
variants on ALA-induced [Ca”*]; response in cells stably expressing wild-type 
GPR120. e, Effect of co-expression of human GPR120 p.R270H variant with 
wild-type GPR120 on ALA-induced [Ca?*]; response. Top: schematic diagram 
of constructs. Bottom: expression of wild type and p.R270H (left), and 
concentration-[Ca*” ]; response for ALA in cells expressing wild-type/wild- 
type, wild-type/R270H or R270H/wild-type receptors (right). **P < 0.01 versus 
the corresponding control value. RFI, relative fluorescence intensity; RFU, 
relative fluorescence unit. All data show mean + s.e.m. 
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Table 1 | Identified variants in GPR120 exons and association between the p.R67C/rs6186610 or p.R270H non-synonymous variant and 


obesity 
Variant Nucleotide Chr 10 MAF, MAF2 MAF2 Adjustment: age and gender Adjustment: age, gender and geography 
change position (controls, (cases, 
n=7,654) n=6,942) OR [95% Cl] P OR [95% Cl] P 

Missense p.R67C/rs6186610 CT 95,316,666 0.05 0.043 0.055 1.16 [1.02, 1.31] 0.022 1.13'[1.00, 1.28] 0.060 
variants p.R270H GA 95,337,031 0.03 0.013 0.024 1.62 [1.31, 2.00] 8.00 x 10°© 1.58 (1.28, 1.95] 2.17 x 107° 
Synonymous p.V38V GSA 95,316,581 0.0016 _ = _ — —_— = 
variants p.S192S GA 95,325,846 0.0016 — = = = = = 

p.V243V C3T 95,328,938 0.0016 —_ —_— _— _ _— _— 

p.S264S GA 95,337,014 0.0016 — a = —_ — = 


Variant position was indicated according to human genome build NCBI36/hg18. Association between p.R67C/rs6186610 or p.R270H variant and obesity was assessed by using a logistic regression adjusted for 
age and gender or for age, gender and geography, under an additive model. Chr, chromosome; MAF, minor allele frequency in sequencing data set (n = 312 extremely obese individuals); MAF2, minor allele 


frequency in the large obesity case-control genotyping data set; OR, odds ratio; Cl, confidence interval. 


identified only two non-synonymous variants, R270H (minor allele 
frequency (MAF), ~3%) and p.R67C/rs6186610 (MAF, ~5%), and 
four rare synonymous variants (MAF, <1%) (Table 1). The two non- 
synonymous variants were subsequently genotyped in 6,942 unrelated 
obese individuals and 7,654 control subjects, all of European origin 
(Supplementary Table 4). By using a logistic regression model adjusted 
for age and sex, we found that R270H associated with obesity under an 
additive model (OR=1.62 [1.31, 2.00]os50, (odds ratio and 95% 
confidence interval), P = 8.00 X 10 °; Table 1); whereas we found 
only a trend for association between p.R67C and obesity (OR= 
1.16 [1.02, 1.31]g50%; P = 0.022; Table 1). It is noteworthy that these 
results were almost the same after adjusting for geographical origin 
(Table 1). 

We then genotyped the p.R270H variant in 1,109 French pedigrees 
selected for obesity (n = 5,045) and in 780 German trios with one 
obese child (n = 2,340). We observed a significant over-transmission 
of the p.R270H low-frequency variant to obese offspring in 117 
pedigrees or trios where the p.R270H variant was present (transmis- 
sion, 62%; P = 0.009; Supplementary Table 5). This family-based study 
excludes a hidden population stratification effect as a cause of spurious 
association. 

Weassessed the functional significance of both the p.R67C mutation 
and the p.R270H mutation in silico using several programs: arginine 
residues at positions 67 and 270 presented a high-evolutionary- 
conservation pattern among mammals and the two amino-acid sub- 
stitutions were predicted to be potentially damaging (Supplementary 
Table 6). To examine the influences of the two non-synonymous var- 
iants on GPR120 function in vitro, we assessed each receptor ability to 
mobilize intracellular calcium (concentration, [Ca**];) in response to 
the endogenous agonist o-linolenic acid (ALA). We found that ALA 
induced [Ca**], responses in T-REx 293 cells expressing either wild- 
type or p.R67C receptor in a dose-dependent manner, whereas 
ALA-induced [Ca’*]; responses in cells expressing p.R270H were sig- 
nificantly lower (P = 1.6 X 10° °) than those in cells expressing wild 
type at ALA concentrations greater than 10 LM (Fig. 3b). We further 
examined the functional ability of the mutated receptors to secrete 
GLP-1 (ZGLP1) from human intestinal NCI-H716 cells, as this cell 
line lacks GPR120 expression and it can secrete GLP-1 in a regulated 
manner’. ALA induced secretion of GLP-1 in NCI-H716 cells expres- 
sing either wild-type (P = 0.004) or p.R67C (P = 3.2 x 10 °) receptor, 
but not in NCI-H716 cells expressing p.R270H mutant (P = 0.96) 
(Fig. 3c). The transfection efficiencies for the GPR120 variant receptors 
were confirmed to be almost the same (data not shown). To examine 
the effect of the p.R270H variant on the wild-type receptor signalling, 
we analysed the [Ca”*]; dose-response curves after the transfection of 
an empty vector, a wild-type receptor plasmid or a p.R270H-mutated 
plasmid into T-REx 293 cells expressing wild-type GPR120. The trans- 
fection of the p.R270H-mutated plasmid suppressed dose-response 
curves, and the maximal ALA-induced [Ca?*]; response was signifi- 
cantly decreased (P = 0.004; Fig. 3d). 

To assess the effect more quantitatively, we analysed [Ca**]; dose- 
response curves in T-REx 293 cells stably expressing bicistronic 


wild-type/wild-type, wild-type/p.R270H or p.R270H/wild-type receptors 
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(Fig. 3e, top). Almost equal levels of receptor protein expression in each 
cell line were confirmed by flow cytometry analysis (Fig. 3e, bottom 
left). Compared with cells expressing wild-type/wild-type receptor, 
the [Ca”*], dose-response curves obtained in cells expressing either 
wild-type/p.R270H or p.R270H/wild-type receptor were markedly 
suppressed, and the maximal ALA-induced [Ca”" ]; response was sig- 
nificantly decreased (P = 1.2 X 10°; Fig. 3e, bottom right). These 
findings suggest that the p.R270H variant that is significantly asso- 
ciated with obesity has an inhibitory effect on GPR120. The 
p-R270H mutant lacks the ability to transduce the signal of long-chain 
free fatty acids, contrary to the p.R67C mutant, which did not associate 
with obesity. 

To analyse whether being a p.R270H variant carrier may affect 
GPR120 expression in the adipose tissue, we quantified GPR120 expres- 
sion in samples from both obese p.R270H carriers and obese non- 
carriers. Two hundred and thirty-eight obese normoglycaemic patients 
from the Atlas Biologique de l’Obésité Sévere cohort had already been 
genotyped for the p.R270H variant. Ten subjects heterozygous for the 
p-R270H variant were matched for age, gender and body mass index 
with ten non-carrier (wild-type) obese normoglycaemic patients (Sup- 
plementary Table 7). The expression of GPR120 was similar between 
p-R270H carriers and wild-type subjects, both in subcutaneous and 
omental adipose tissues (Supplementary Fig. 7a), suggesting that the 
presence of the functionally deleterious mutation has no primary or 
secondary effect on gene expression in fat depots. The adipogenesis 
marker PPARG, the lipogenesis-related factor SCD and the macrophage 
marker CD68 were found similarly well expressed in the adipose tissues 
of wild-type and p.R270H carrier patients (Supplementary Fig. 7b, c). 
Nevertheless, the expression of the fatty-acid-binding protein FABP4 
in omental adipose tissue was significantly lower in p.R270H carriers 
than in wild-type individuals (28% decrease, P = 0.043; Supplementary 
Fig. 7b). 

Our results provide convincing evidence that the lipid sensor GPR120 
is involved in obesity in both mice and humans. Given the role of 
GPR120 as a physiologic integrator of the environment (especially the 
fatty diet), these data provide insight into the molecular mechanisms by 
which the “Westernized’ diet may contribute to early-onset obesity and 
associated complications including non-alcoholic steatohepatitis. It also 
brings some understanding of the metabolic effects of the omega-3 fatty 
acids, which are often proposed as food supplements. This may open 
novel avenues of research for drug development in the treatment of 
obesity, lipid metabolism abnormalities and liver diseases, because 
receptors that sense free fatty acids represent attractive drug targets. 


METHODS SUMMARY 


GPR120-deficient mice were generated by deleting Gpr120 exon 1. All animal 
procedures and euthanasia were reviewed by the local animal care committee 
approved by local government authorities. Blood analysis, extraction and detec- 
tion of mRNA and proteins, and immunohistochemical analysis, were performed 
following standard protocols as described previously*****. Details of antibodies, 
primers and probes are given in Methods. The level of significance for the differ- 
ence between data sets was assessed using Student’s t-test. Analysis of variance 
followed by Tukey’s test was used for multiple comparisons. 
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In human, GPR120 expression in liver and in both omental and subcutaneous 
adipose tissues was assessed by quantitative RT-PCR (Taqman), in lean and obese 
subjects from the Atlas Biologique de l’Obésité Sévere cohort. The four GPR120 
exons were sequenced in 312 French, extremely obese subjects following a standard 
Sanger protocol. The two identified non-synonymous variants (p.R270H and 
p-R67C/rs6186610) were subsequently genotyped in a large European obesity 
case-control study (“eases = 6,942, Mcontrols = 7,654), by high-resolution melting 
analysis and TaqMan, respectively. The association between obesity status and 
each variant was assessed using logistic regression adjusted first for age and gender 
and then for age, gender and geography origin, under an additive model. The 
consequences of the two identified non-synonymous variants for GPR120 func- 
tion ([Ca?*]; response and GLP-1 secretion) were assessed in vitro. The human 
study protocol was approved by the local ethics committee, and participants from 
all of the studies signed an informed consent form. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Generation and genotyping of GPR120-deficient mice. GPR120-deficient mice 
on a mixed C57B1/6/129 background were generated by homologous recombina- 
tion. Exon 1 of the Gpr120 gene was replaced with a PGK-neo cassette (Supplemen- 
tary Fig. 1). 

Animals. Mice were housed under a 12-hr light-dark cycle and given regular 
chow, MEF (Oriental Yeast Co.). For HFD studies, 5-week-old male mice were 
placed ona 58Y1 diet (PMI Nutrition International) for a total period of 11 weeks. 
The methods used for animal care and experimental procedures were approved by 
the Animal Care Committee of Kyoto University. 

Indirect calorimetry. Twenty-four-hour energy expenditure and respiratory 
quotient (RQ) were measured by indirect calorimetry, using an open-circuit 
calorimeter system (MK-5000RQ, Muromachi Kikai Co.). Respiratory quotient 
is the ratio of carbon dioxide production to oxygen consumption (VO3). Energy 
expenditure was calculated as the product of the calorific value of oxygen 
(3.815 + 1.232RQ) and VO ;. Locomotor activity was measured by using an 
infrared-ray passive sensor system (Supermex, Muromachi Kikai Co.). 
Histology and immunohistochemistry. Epididymal adipose and pancreatic tissues 
were fixed in 10% neutral-buffered formalin, embedded in paraffin, and sectioned at 
5 um. H&E staining was performed using standard techniques. To measure the 
diameter of adipocytes and the area of pancreatic islets, the diameters of 100 cells 
from five sections from each group were measured using NIH IMAGE software. More 
than 10 fields were examined, islet area was traced and total islet area was calculated 
and expressed as the average score. Liver tissues were embedded in OCT compound 
(Sakura Finetech) and snap-frozen in liquid nitrogen. Tissue sections were stained 
with Oil Red O (Sigma-Aldrich) for lipid deposition using standard methods. 
Triglyceride content assay. To determine the triglyceride content of liver, tissue 
was homogenized with 1/2.5/1.25 (vol/vol) 0.5M acetic acid/methanol/ 
chloroform. The mixture was shaken and 1.25 volumes of chloroform added. 
The mixture was shaken overnight and then 1.25 volumes of 0.5 M acetic acid 
added. After centrifugation at 1,500g for 10 min, the organic layer was collected, 
dried and resuspended in 100% isopropyl alcohol. Measurements were conducted 
using Triglyceride E-test Wako (Wako). 

Glucose tolerance and insulin tolerance tests. Glucose tolerance assays were 
performed on 24-hr-fasted mice. After baseline glucose values were individually 
established using One Touch Ultra (LifeScan), each mouse was given an intra- 
peritoneal injection of 1.5 mg glucose per gram of body weight. Insulin tolerance 
was conducted using the same glucometer. After baseline glucose values were 
established, mice were given human insulin (0.75mU g intraperitoneal; 
Sigma-Aldrich). Clearance of plasma glucose was subsequently monitored at 15, 
30, 60, 90 and 120 min post-injection. 

Immunoblot analysis. For insulin stimulation, 5 U insulin (Sigma-Aldrich) was 
injected through the inferior vena cava. Five minutes later, samples of liver, skeletal 
muscle or WAT were dissected and immediately frozen in liquid nitrogen. Immuno- 
blot analysis were performed as described previously***”°. Anti-IRS1 (Millipore), 
anti-IRS2 (Millipore), anti-SCD1 (Santa Cruz Biotechnology), anti-IRB anti-AKT 
(Cell Signaling Technology), anti-p-AKT (Cell Signaling Technology) and anti-B- 
actin (Sigma-Aldrich) antibodies were used as the primary antibodies. 

Mouse gene expression analysis. Total RNA was extracted from tissue or cells 
using ISOGEN (Nippon Gene). Quantitative RT-PCR and microarray analysis 
were performed as described previously**”*. Briefly, genome-wide mRNA expres- 
sion profiles were obtained by microarray analysis with the Affymetrix GeneChip 
Mouse 430 2.0 Array, according to the manufacturer’s instructions. We used the 
robust multi-array analysis expression measure that represents the log-transformation 
of intensities (background corrected and normalized) from the GeneChips”. 
Functional associations between differentially expressed genes were analysed using 
Ingenuity Pathways Analysis (version 4.0, Ingenuity Systems). 

Microcomputed tomography scanning. Images were obtained using a micro- 
computed tomography system (SHIMADZU ClairvivoCT) with a high-resolution 
flat-panel detector. The maximum resolution of this modality was less than 40 tum. 
The scanner was assumed to have a cylindrical field of view of 65.3 mm in section 
view and of 300 mm in transaxial view. The X-ray source was biased at 60 keV with 
the anode current set to 160 pA. Computed tomography images were analysed 
with OSIRIX software (http://www.osirix-viewer.com/). 

Fatty acid composition of epididymal WAT, liver and plasma. Esterified and 
non-esterified fatty acid composition was measured by gas chromatography. 
Briefly, to analyse esterified fatty acid, samples of epididymal adipose tissue 
(20-25 mg), liver (25-30 mg) and plasma (100 ul) were snap-frozen in liquid 
nitrogen and homogenized in 4 ml of 0.5 N KOH-methanol. Samples were then 
boiled at 100 °C for 30 min to hydrolyse. Total lipids in each sample homogenate 
were then extracted with hexane, followed by trans-esterification of fatty acids 
using boron trifluoride-methanol at 100 °C for 15 min. Methylated fatty acids were 
then extracted with hexane and analysed using a GC-2010AF gas chromatograph 


(SHIMADZU). For the analysis of non-esterified fatty acid, samples of epididymal 
adipose tissue (10-15 mg), liver (10-15 mg) and plasma (100 pl) were snap- 
frozen in liquid nitrogen and homogenized in a mixture of 1.2 ml water, 3 ml 
methanol and 1.5 ml chloroform. Total lipids in each sample homogenate were 
extracted with a mixture of 1.2 ml of water and 1.2 ml of chloroform, followed by 
silylation of fatty acids using N,O-bis(trimethylsilyl)trifluoroacetamide with 1% 
trimethylchlorosilane at 100 °C for 60 min. Silylated fatty acids were then extracted 
with hexane and analysed using a GC-2010AF gas chromatograph (SHIMADZU). 
Mouse embryonic fibroblast adipogenesis assay. To prepare MEFs, we minced 
13.5-d-post-coital mouse embryos and digested them with trypsin. Cells were 
collected and cultured in modified Eagle’s medium (%-MEM; supplemented with 
10% fetal bovine serum (FBS), 50 Uml ' penicillinand 50 mg ml ' streptomycin). 
We induced confluent MEFs to undergo adipogenic differentiation by incubating 
them first for 2d with 10 pg ml insulin, 250nM dexamethasone and 0.5 mM 
isobutylmethylxanthine (Sigma-Aldrich). We measured cellular triglyceride con- 
tent with Triglyceride E-test Wako (Wako). 

Lipid infusion. Intralipid solution with 2 mM triglycerides:palmitoleate was pre- 
pared using a previously described protocol’. Briefly, lipids were dissolved in a 
solvent containing 5% glycerol and 0.72% phosphocholine in 0.9% saline and 
sonicated repeatedly. Lipids stayed in suspension for one week and had to be 
vortexed well before loading the syringe and tubing to prevent clogging. Before 
lipid infusion, mice were anaesthetized and an indwelling catheter was inserted in 
the left internal jugular vein. After overnight fasting, lipids were infused at a rate of 
500 ml kg! h' for 6h. At the end of the infusion, tissues were collected. 
Statistical analysis of the GPR120-deficient mouse study. The level of signifi- 
cance for the difference between data sets was assessed using Student’s t-test. 
Analysis of variance followed by Tukey’s test was used for multiple comparisons. 
Data were expressed as mean + s.e.m. P< 0.05 was considered to be statistically 
significant. 

Human study population. The study protocol was approved by all local ethics 
committees and informed consent was obtained from each subject before par- 
ticipation in the study, in accordance with the Declaration of Helsinki principles. 
For children younger than 18 years, an oral consent was obtained and parents 
provided written informed consent. All subjects were of European origin. 
Human gene expression analysis. We used liver, subcutaneous and omental 
adipose tissue samples from the Atlas Biologique de l’Obésité Sévere’ (ABOS) 
cohort (ClinicalGov NCT01129297), a cohort studied in the Département de 
Chirurgie Générale et Endocrinienne* (Lille CHRU). Total RNA was extracted 
from the tissues using an RNeasy protect Mini Kit (QIAGEN) and quantified by 
absorbance at 260 nm and 280 nm in a PerkinElmer spectrophotometer. Human 
GPR120, FABP4, PPARG, CD68 and SCD mRNA levels were quantified by reverse 
transcription reaction followed by qRT-PCR. Quantitative assessment of human 
mRNA expression was performed using TaqMan Gene Expression Assays 
(Hs01111664_m1: GPR120 and Hs99999905_m1: GAPDH; Hs00609791_m1: 
FABP4; Hs00234592_m1: PPARG; Hs00154355_m1: CD68; Hs01682761_m1: 
SCD; Applied Biosystems) with an Applied Biosystems 7900HT Fast Real-Time 
PCR System. As an internal control for potential housekeeper reference variability, 
gene transcript levels were normalized to GAPDH reference housekeeper tran- 
script level. The mean of the triplicate cycle thresholds of the target was normalized 
to the mean of triplicate cycle thresholds of the reference internal housekeeper genes 
using the formula 2°74" —CTuret, which yielded a relative target-to-reference tran- 
script concentration value as a fraction of reference transcript. Samples for which 
the cycle threshold was above 35 were excluded from the analysis. 

GPR120 exon sequencing. We sequenced the four GPR120 exons in 312 obese 
patients including 121 French obese adults and 191 French obese children who 
were recruited by the CNRS-UMR8199 unit and the Department of Nutrition of 
Paris Hotel Dieu Hospital. GPR120 is located on human chromosome 10q23.33 
and encodes a 377-amino-acid protein (NCBI NM_181745.3 and NP_859529). 
PCR conditions and primer sequences are available on request. Fragments were 
bidirectionally sequenced using the automated 3730xl DNA Analyzer (Applied 
Biosystems). Electrophoregram reads were assembled and analysed using the 
VARIANT REPORTER software (Applied Biosystems). The locations of the var- 
iants are displayed by base numbers counting from the ATG translation initiation 
codon following the Human Genome Variation Society nomenclature for the 
description of sequence variations. The positions of mutations were indicated 
by reference to the human genome build NCBI36/hg18. 

Genotyping of the p.R270H and p.R67C/rs6186610 variants. We genotyped 
the two non-synonymous variants in 6,942 unrelated obese subjects and in 7,654 
control subjects, all of European descent. Genotyped populations are described in 
Supplementary Table 4. The set of obese subjects included 516 unrelated French 
obese children who were recruited by the CNRS-UMR8199 unit or Toulouse 
Children’s Hospital”’; 332 Italian obese children from Verona*’ or Rome”; 170 
Finnish obese adolescents from the Northern Finland Birth Cohort 1986 
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(NFBC1986); 1,164 unrelated French obese adults from the ABOS cohort’* or 
recruited by the CNRS-UMR8199 unit and the Department of Nutrition of 
Paris Hotel Dieu Hospital’’; 2,514 Belgian obese patients from the outpatient 
obesity clinic at Antwerp University Hospital’’; 1,736 Swiss obese subjects who 
were recruited after gastric surgery in Zurich**; and 510 Greek obese subjects 
recruited in the Hippokration Hospital of Thessaloniki or in the Second 
Department of Internal Medicine of the Hospital of Alexandroupolis**. The set 
of control subjects included 422 Italian lean children from Verona” 4,639 Finnish 
lean adolescents from the NFBC1986 cohort*; 1,976 French lean adults from the 
D.ES.LR (Data from the Epidemiological Study on the Insulin Resistance 
syndrome) prospective study** and from the Haguenau study”; 148 Belgian lean 
subjects from Antwerp Hospital”; and 469 Greek lean individuals recruited in 
medical examination centres in Thessaloniki**. The 1,109 French pedigrees 
selected for obesity were recruited by the CNRS-UMR8199**, and the 780 
German childhood obesity trios were recruited at the Universities of Marburg 
and Essen”. The p.R270H variant was genotyped using the LightCycler 480 
High Resolution Melting (HRM) Master kit (Roche), following the manufacturer’s 
protocol. Genotyping of p.R67C/rs6186610 was performed using a custom TaqMan 
assay according to the manufacturer’s instructions (Applied Biosystems). Allelic 
discrimination was performed using an Applied Biosystems 7900HT Fast Real- 
Time PCR System and SDS 2.3 software. For both variants, genotype success rate 
was at least 95% and no deviation (P > 0.05) from Hardy-Weinberg equilibrium 
was observed in any of the examined populations. 

Phenotyping. The 90th and 97th body mass index (BMI) percentiles were used as 
thresholds for childhood overweight and obesity, respectively, according to the 
recommendations of the European Childhood Obesity Group study in local reference 
populations". Adult subjects were defined as normal (BMI <25kgm ”’), over- 
weight (25 <BMI<30kgm ~) and obese (BMI=30kgm *) according to the 
International Obesity Task Force recommendations. 

In silico analysis of both p.R270H and p.R67C variants. Phylogenetic conser- 
vation of the part of GPR120 containing each non-synonymous variant was 
assessed using the UCSC genome browser (Vertebrate Multiz Alignment & 
Conservation), based on a phylogenetic hidden Markov model, phastCons*. To 
predict the possible effect of both amino-acid substitutions on the structure and 
function of GPR120, we used several programs: the POLYPHEN (polymorphism 
phenotyping) web-based program**; PANTHER” (protein analysis through 
evolutionary relationships); the SIFT (sorting intolerant from tolerant) algo- 
rithm**; the SNAP (screening for non-acceptable polymorphisms) software”; 
and the PMUT web-based program”. 

Plasmid construction. A FLAG-human GPR120/pcDNA5/FRT/TO plasmid was 
constructed by ligating GPR120 complementary DNA into the multicloning site of 
the mammalian expression vector pCDNA5/FRT/TO (Invitrogen) with the amino- 
terminal FLAG tag. The point mutation for constructing the FLAG-human GPR120 
p-R67C/pcDNAS5/FRT/TO and FLAG-GPR120  p.R270H/pcDNA5/FRT/TO 
plasmids was carried out using the following primers: p.R67C (sense: 5'- 
getgctegtgeceteccgacgacgcc-3’; antisense: 5’-ggcgtcgtcggcacgccaccagcacc-3') and 
p-R270H (sense: 5’-agccaccagatccacgtgtcccagcaggac-3'; anti-sense: 5’-gtcctgctggga 
cacgtggatctggtggct-3'). All constructs were confirmed by DNA sequencing. 

Cell lines and cell culture. Flp-In T-REx-293 (T-REx 293) cells (Invitrogen) were 
used to develop inducible and stable cell lines expressing GPR120 (wild type, 
p-R270H or p.R67C). T-REx 293 cells were routinely cultured in Dulbecco’s modified 
Eagle’s medium (DMEM; Sigma) supplemented with 10% FBS, 100 pg ml * Zeocin 
(Invitrogen) and 10 pg ml! blasticidin $ (Funakoshi). T-REx 293 cells were trans- 
fected with FLAG-GPR120 (wild type, p.R270H or p.R67C)/pcDNA5/FRT/TO 
using Lipofectamine reagent (Invitrogen) and selected with DMEM, which had 
been supplemented with 10% FBS, 10pgml ’ blasticidin § and 100pgml * 
hygromycin B (Gibco BRL). GPR120 protein expression was induced by adding 
10 1g ml! of doxycycline hyclate (Dox; Sigma) for 48 h. Human NCI-H716 cells 
were obtained from the American Type Culture Collection (Manassas). Cells were 
grown in suspension in Roswell Park Memorial Institute 1640 medium supple- 
mented with 10% FBS, 1001U ml’ penicillin and 100 pg ml”! streptomycin. 
[Ca?*]; response analysis. Cells were seeded at a density of 2 X 10° cells per well on 
collagen-coated 96-well plates, incubated at 37°C for 21h and then incubated in 
Hanks’ Balanced Salt Solution (HBSS, pH 7.4) containing Calcium Assay Kit 
Component A (Molecular Devices) for 1h at 20°C. ALA used in the fluorometric 
imaging plate reader (FLIPR, Molecular Devices) assay were dissolved in HBSS con- 
taining 1% DMSO and prepared in another set of 96-well plates. These plates were set 
on the FLIPR, and mobilization of [Ca”*]; evoked by agonists was monitored. 
Transfection. One million cells were seeded into a 3.5-cm-diameter dish before 
transfection. NCI-H716 cells were transfected with 5 1g of each plasmid using 
Lipofectamine 2000 (Invitrogen) according to the manufacturer’s protocol. At 
24h post-transfection, transfection of each FLAG-tagged construct was confirmed 
by anti-FLAG FACS analysis. Then the cells were reseeded in 24-well culture plates 
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coated with Matrigel matrix (BD Biosciences) at a density of approximately 
3 X 10° cells per well for the secretion studies. To test the effect of variant receptors 
on the ALA-induced [Ca”*]; response of the wild-type receptor, 2 x 10’ T-REx 
293 cells expressing Dox-inducible FLAG-GPR120 wild type were seeded into a 
15-cm-diameter dish before transfection. Cells were then transfected with 32 ug of 
each plasmid (empty vector, wild type and p.R270H GPR120) using 
Lipofectamine 2000 (Invitrogen) according to the manufacturer’s protocol. At 
24h post-transfection, cells were reseeded at a density of 2 x 10° cells per well 
on collagen-coated 96-well plates and treated with 10 jig ml’ of Dox, and at 48h 
post-transfection, ALA-induced [Ca’*], response was monitored. 

GLP-1 secretion analysis. Cells were serum-starved with FBS-free DMEM for 3 h, 
washed with HBSS and incubated for 2h at 37°C in 0.3 ml FBS-free DMEM 
containing DMSO (negative control), 14M phorbol 12-myristate 13-acetate 
(positive control) or 10014M ALA. Supernatants were collected and the active 
GLP-1 concentration in the supernatant was determined by enzyme immunoassay 
using GLP-1 (Active) ELISA Kit (Millipore). 

Flow cytometry analysis. Anti-FLAG (Sigma) and anti-HA (Roche) antibodies 
were used for staining. Data were acquired and analysed on FACSCalibur with 
CELLQUEST software (Becton Dickinson). 

Statistical analysis of human study. We assessed the effect of both non- 
synonymous variants (p.R270H and p.R67C) on obesity using a logistic regression 
adjusted first for age and gender and then for age, gender and geography, under an 
additive model, using the software R (version 2.12). Adjustment for geography was 
achieved to reflect a north-south gradient between the six different countries of 
origin of the study participants. An ordinal variable was created and coded: 1 for 
Finland, 2 for Belgium, 3 for France and Switzerland, 4 for Italy and 5 for Greece. 
This variable was added as a covariate in the logistic regression model. 

Data analysis for the [Ca*‘]; response was performed using IGOR PRO 
(WaveMetrics). Significant differences between expression among wild-type and 
heterozygous groups, and among lean and obese wild-type subjects, were assessed 
using non-parametric Mann-Whitney analysis (GRAPHPAD PRISM 5 software). 
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The role of Drosophila Piezo in mechanical 


nociception 


Sung Eun Kim', Bertrand Coste!, Abhishek Chadha!, Boaz Cook! & Ardem Patapoutian>? 


Transduction of mechanical stimuli by receptor cells is essential for 
senses such as hearing, touch and pain’ ~*. Ion channels have a role in 
neuronal mechanotransduction in invertebrates'; however, func- 
tional conservation of these ion channels in mammalian mechano- 
transduction is not observed. For example, no mechanoreceptor 
potential C (NOMPC), a member of transient receptor potential 
(TRP) ion channel family, acts as a mechanotransducer in 
Drosophila melanogaster’ and Caenorhabditis elegans*’; however, 
it has no orthologues in mammals. Degenerin/epithelial sodium 
channel (DEG/ENaC) family members are mechanotransducers in 
C. elegans® and potentially in D. melanogaster’; however, a direct 
role of its mammalian homologues in sensing mechanical force has 
not been shown. Recently, Piezol (also known as Fam38a) and 
Piezo2 (also known as Fam38b) were identified as components of 
mechanically activated channels in mammals"’. The Piezo family are 
evolutionarily conserved transmembrane proteins. It is unknown 
whether they function in mechanical sensing in vivo and, if they 
do, which mechanosensory modalities they mediate. Here we study 
the physiological role of the single Piezo member in D. melanogaster 
(Dmpiezo; also known as CG8486). Dmpiezo expression in human 
cells induces mechanically activated currents, similar to its mam- 
malian counterparts". Behavioural responses to noxious mech- 
anical stimuli were severely reduced in Dmpiezo knockout larvae, 
whereas responses to another noxious stimulus or touch were not 
affected. Knocking down Dmpicezo in sensory neurons that mediate 
nociception and express the DEG/ENaC ion channel pickpocket 
(ppk) was sufficient to impair responses to noxious mechanical 
stimuli. Furthermore, expression of Dmpiezo in these same neu- 
rons rescued the phenotype of the constitutive Dmpiezo knockout 
larvae. Accordingly, electrophysiological recordings from ppk- 
positive neurons revealed a Dmpiezo-dependent, mechanically 
activated current. Finally, we found that Dmpiezo and ppk function 
in parallel pathways in ppk-positive cells, and that mechanical 
nociception is abolished in the absence of both channels. These data 
demonstrate the physiological relevance of the Piezo family in 
mechanotransduction in vivo, supporting a role of Piezo proteins 
in mechanosensory nociception. 

D. melanogaster is widely used to study mechanotransduction, and 
genetic studies have identified several ion channels that are essential 
for mechanosensation®*”’*'*. However, none of the identified proteins 
have been shown to be activated by mechanical force when expressed 
in heterologous systems. Because expression of mouse Piezo proteins 
in a variety of mammalian cells induces mechanically activated 
currents’®, we investigated whether the Drosophila counterpart is also 
sufficient to induce mechanosensitivity. Similar to its mammalian 
counterparts, the Dmpiezo gene is predicted to consist ofa large number 
of transmembrane domains (39; Supplementary Fig. 1). Although fly 
and mammalian piezo genes do not exhibit extensive sequence conser- 
vation (24% identity), expression of Dmpiezo in cultured human cells 
induced mechanically activated cationic currents, suggesting a role of 
Dmpiezo in mechanotransduction". 


To characterize Dmpiezo expression in flies we used a fusion between 
the Dmpiezo enhancer/promoter region and GAL4 (DmpiezoP- 
GAL4). Four independent DmpiezoP-GAL4 transgenic insertions were 
examined to avoid insertional effects on GAL4 expression. We used 
green fluorescent protein (GFP) regulated by upstream activating 
sequence elements (UAS) (UAS-GFP) for labelling cells, except for 
arborized neurons that were optimally visualized using the membrane- 
targeted UAS-CD8::GFP. We found fluorescent labelling induced by 
Dmpiezo enhancer/promoter region in all types of sensory neurons 
and several non-neuronal tissues in both adults and larvae (Sup- 
plementary Fig. 2). This diverse pattern of Dmpiezo expression observed 
in Drosophila is in accord with the expression of Piezol and Piezo2 in 
mice”®. 

We created Dmpiezo knockout flies in which all 31 coding exons 
were deleted using genomic recombination’> (Fig. 1a, see Supplemen- 
tary Fig. 3 for details). The knockout flies were viable, fertile and did 
not show a lack of coordination or a defect in bristle mechanoreceptor 
potential (Supplementary Fig. 4). We studied whether Dmpiezo 
knockout larvae have mechanical nociception deficits by using a mech- 
anically induced escape behaviour assay”’*"*. Stimulation with von 
Frey filaments that ranged from 2-60 milliNewton (mN) demon- 
strated that Dmpiezo knockout larvae have a severe response deficit 
over a wide range (Fig. 1b). Repeated stimulations of the same larvae 
resulted in comparable responsiveness in both wild-type and Dmpiezo 
knockout, indicating that the stimuli did not induce considerable 
damage to the sensory system (Fig. 1c, d). A 153 + 11.0 mN filament 
elicited responses only to the first of three stimulations in wild-type 
larvae, arguing that this amount of force is damaging (data not shown). 
For further experiments, we chose to stimulate the larvae using a 
45 mN filament, which has been used in a previous study”, and elicits 
a substantial response in both wild-type and Dmpiezo mutant larvae. 
Thirty four + 4.4% of Dmpiezo knockout larvae showed a response to 
45 mN filament stimulation, compared to over 80% of wild-type or 
heterozygote larvae (Fig. le). As a control for the genetic background, 
we used larvae that carry the Dmpiezo knockout allele on one chro- 
mosome and a deficiency in which the entire Dmpiezo genomic region 
is deleted on the homologous chromosome. The defect in the trans- 
heterozygous larvae was similar to the knockout homozygote pheno- 
type (51 + 3.9%, P = 0.091). In contrast, Dmpiezo knockout larvae 
were indistinguishable from wild type in an assay for responses to high 
temperature, a different noxious stimulus that elicits the same escape 
response” (Fig. 1f). Therefore, Dmpiezo knockout larvae retain a 
normal ability to elicit the escape behaviour in response to noxious 
stimuli, whereas Dmpiezo is specifically required for the mechanical 
modality of nociception. To evaluate the possible role of Dmpiezo in 
other modes of larval mechanical sensing, we tested the sensitivity of 
Dmpiezo knockout to gentle touch, which is mediated through ciliated 
neurons’”'*. We observed no defect in the sensitivity of Dmpiezo 
knockout larvae to innocuous gentle touch (Fig. 1g). 

A mechanical nociception phenotype was previously observed in 
mutants of ppk, a DEG/ENaC channel’ and painless (pain), a TRPA 
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Figure 1 | Mechanical nociception defect in Dmpiezo knockout larvae. 

a, Genomic map showing wild-type Dmpiezo gene (top) and engineered 
Dmpiezo knockout (bottom). Yellow and black boxes represent coding and 
non-coding exons, respectively. The segment deleted from the left arm of 
chromosome 2 (2L) in Dmpiezo knockout is marked with a grey box. 

b, Mechanical nociception assay using a range of stimulus forces in wild-type 
(WT) and Dmpiezo knockout larvae. n = 40 from four independent 
experiments. *P < 0.05, **P < 0.01 from two-tailed paired Student t-test. 


ion channel’*. The specificity of Dmpiezo knockout to mechanical 
nociception resembles the phenotype of ppk, as pain is also essential 
for sensing thermal nociception’*. We therefore tested the role of 
Dmpiezo in ppk-positive cells using ppk-GAL4, which labels subclasses 
of multidendritic neurons’’”®. The multidendritic neurons are non- 
ciliated receptor cells that tile the body wall of the larvae and respond 


ppk-EGFP 
Dmpiezo>DsRed 


_100 + _ 

& 1 & 

@ 804 ° 

$ 60- $s 

0) so] 

2 404 a. UE ao 

2 on 2 a] 

S 20} S 20- 

8 04 g of \ Ar a 

b ps ps ah a8 oat? 48 
py i> 5 el apre® 4629 oe 61? one’ 
ot? eh a on Ko osteo oetow™ 
go _ott? e icy _& 

Nis or wr eK 


2 | NATURE | VOL 000 | 00 MONTH 2012 


Kernan score 


c, d, Mechanical nociception assay using repeated stimuli of the same larvae. 
n = 40. KO, knockout. e, Mechanical nociception assay using a 45 mN von Frey 
filament with wild type (+/+), heterozygous knockout (+/—), heterozygous 
deficiency (Def/+), homozygous KO (—/—) and trans-heterozygous KO 
(Def/—). n> 85. ***P < 0.001. f, Thermal nociception assay using heated 
probe (45 °C). n = 60. g, Gentle touch assay’’. For details about the Kernan 
score, see Methods. n > 150. Error bars indicate mean = s.e.m. NS, 

not significant. 


to a variety of external stimuli such as mechanical forces, temperature 
and light”'*'°?". We used enhanced (E)GFP driven directly by the reg- 
ulatory regions of the ppk gene (ppk-EGFP)” together with a red fluor- 
escent protein expression in Dmpiezo-positive cells to probe Dmpiezo 
and ppk co-expression. Indeed, we did observe that all ppk-positive cells 
also expressed Dmpiezo (Fig. 2a). Next we used ppk-GALA4 to drive the 
expression of Dmpiezo RNA interference (RNAi) to test whether 
Dmpiezo function is specifically required in ppk-expressing cells. The 
restricted knockdown of Dmpiezo resulted in a mechanical nociceptive 
phenotype (Fig. 2b) similar to the phenotype observed in Dmpiezo 
knockout larvae (Fig. le). In a complementary approach, we used 
ppk-GAL4-driven expression of Dmpiezo complementary DNA in an 
attempt to rescue the mechanical nociception phenotype of Dmpiezo 
knockout larvae. We used a fusion between DmPiezo and GFP to 
monitor expression levels in ppk cells and DmPiezo localization within 
the neurons. GFP-DmPiezo fusion protein induces mechanically 
activated currents in human cell lines, similar to untagged DmPiezo, 


Figure 2 | Dmpiezo functions in ppk-positive type II sensory neurons. 

a, Double fluorescence labelling using ppk-EGFP (green) and DmpiezoP-GAL4 
that drives the expression of the nucleus targeted UAS-DsRed-NLS (red). A 
representative high-magnification image shows one ppk-positive neuron 
(arrow). All three ppk-positive cells in each hemisegment expressed Dmpiezo in 
all segments. b, Mechanical nociception assay with Dmpiezo knockdown larvae 
in ppk-expressing cells by ppk-GAL4 and UAS-Dmpiezo-RNAi. n > 85, 

*** P< (),001. c, Mechanical nociception assay in rescued Dmpiezo knockout. 
GFP-DmPiezo was expressed in ppk-cells using ppk-GAL4 and UAS-GFP- 
DmPiezo. n> 60. **P<0.01. Error bars indicate mean + s.e.m. 
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Figure 3 | Dmpiezo mediates mechanically activated currents in ppk- 
positive neurons. a, Representative currents elicited by negative pipette 
pressure (0 to —60 mm Hg, A10 mm Hg) in cell-attached configuration at 
—80 mV in wild type (left) and Dmpiezo ‘~ (right). b, Average peak current- 
pressure relationship of stretch-activated currents in wild type (n = 12 cells) 
and Dmpiezo ‘~ (n= 13 cells). Data points are mean + s.e.m. fitted with a 
Boltzmann equation. **P < 0.01, ***P < 0.001, Mann-Whitney test. 


confirming functionality (Supplementary Fig. 5a—c). When expressed 
in Drosophila, GFP-DmPiezo fluorescence was present throughout cell 
bodies, axons and dendritic arborizations of ppk-positive neurons 
(Supplementary Fig. 5d). Importantly, expression of GFP-DmPiezo 
in ppk-positive neurons alone was sufficient to rescue the mechanical 
nociception defect of Dmpiezo knockout larvae (Fig. 2c). These data 
suggest that Dmpiezo functions in ppk-positive neurons to mediate 
mechanical nociception. 

To test if the ppk-positive neurons respond to mechanical stimuli 
and if Dmpiezo mediates such responses, we performed electrophysio- 
logical recordings from isolated cells. Larvae that had GFP labelling in 
ppk-positive neurons were dissociated using enzymatic digestion and 
mechanical trituration. Plated fluorescent neurons were then tested 
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using patch-clamp recordings in the cell-attached configuration, and 
they were stimulated using negative pressure through the recording 
pipette’®. Stimulating wild-type neurons resulted in a current that was 
rapidly activated and had a half-maximal activation (P59) of 
27.6 + 7.6mm Hg (Fig. 3). These currents were not observed in the 
Dmpiezo knockout mutant neurons (Fig. 3). Therefore, ppk-positive 
neurons, which mediate the avoidance response to noxious stimuli, 
display Dmpiezo-dependent, mechanically activated currents. 

Silencing of ppk cells resulted in complete abolition of noxious 
mechanosensation (Supplementary Fig. 6), in accord with the severe 
defect previously observed"*. In contrast, only a moderate deficit is 
observed upon eliminating or knocking down ppk in the same cells’, 
suggesting that there are multiple pathways for mechanical sensing. 
We tested mechanical nociception in larvae that are deficient in 
Dmpiezo and either pain or ppk to gain insight into cellular pathways 
that involve mechanotransduction in these cells. Once again, we used a 
45 mN filament, enabling us to monitor both Dmpiezo-dependent and 
independent mechanisms (Fig. 1b). The Dmpiezo::pain double mutant 
had a defect that was comparable to each one of the mutants separately, 
suggesting that Dmpiezo and pain might function in the same pathway 
(Fig. 4a). Larvae that are heterozygous for both Dmpiezo and pain 
showed a response deficit whereas each one of them separately 
was normal (Fig. 4b), further demonstrating their role in a common 
signalling mechanism. Remarkably, combining both Dmpiezo and 
ppk knockdowns resulted in a nearly complete abolishment of res- 
ponses to noxious mechanical stimuli (Fig. 4c). Importantly, responses 
to noxious temperatures and touch were normal in larvae with both 
Dmpiezo and ppk knocked down (Fig. 4d, e). These data indicate that 
Dmpiezo and ppk function in two parallel pathways in ppk-positive 
sensory neurons, and that together they constitute the response to 
noxious mechanical stimuli. There could be many reasons why the 
mechanically activated currents we observe are entirely dependent on 
DmPiezo (Fig. 3). This could either be because PPK responds to a 
different modality of mechanical stimulus or due to the specific experi- 
mental settings (for example, level of applied forces, solutions, applied 
voltage). Future experiments should resolve this issue. 
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Figure 4 | Dmpiezo and ppk function in parallel pathways. a, Mechanical 
nociception assay using a 45 mN von Frey filament with double-null mutant of 
Dmpiezo and painless. Single-knockout strains were used as controls and the 
wild-type strain is w'!"®. n > 60. b, Mechanical nociception assay on 
heterozygous larvae for Dmpiezo and/or pain. n (heterozygote Dmpiezo 
knockout) = 74 from three trials, n (heterozygote painless') = 169 from five 
trials, n (trans-heterozygote) = 166 from five trials. c, Mechanical nociception 
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assay with ppk and Dmpiezo knockdown. ppk and/or Dmpiezo RNAi were 
driven by ppk-GAL4. n> 90. *P<0.5, ***P < 0.001. d, Gentle touch 
sensitivity assay with ppk and Dmpiezo knockdown. For details about the 
Kernan score, see Methods. Wild type is w!!!8_ 4 > 90. e, Thermal nociception 
assay using 45 °C probe with ppk and Dmpiezo knockdown. n > 75. Error bars 
indicate mean + s.e.m. 


00 MONTH 2012 | VOL 000 | NATURE | 3 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


Using the Drosophila model system we have demonstrated that 
piezo is essential for sensing noxious mechanical stimulus in vivo. 
This is the first demonstration that a Piezo family member is essential 
for mechanotransduction in the whole animal. Indeed, Dmpiezo is, to 
our knowledge, the first eukaryotic excitatory channel component 
shown to be activated by mechanical force in a heterologous expres- 
sion system and required for sensory mechanotransduction in vivo. 
Piezo2 is expressed in mouse dorsal root ganglion neurons that are 
involved in sensing nociception, and is required for rapidly adapting 
mechanically activated currents in such isolated neurons’”. This study 
raises the possibility that mammalian Piezo2 is also required for mech- 
anical pain transduction in vivo. Furthermore, Drosophila genetics can 
now be used to map cellular pathways involved in piezo-dependent 
mechanotransduction in sensory neurons and beyond. 


METHODS SUMMARY 

Fly stocks. PiggyBacs (PBac{WH}CG8486-f02291, PBac{RB}CG8486-e00109; 
Exelixis Collection at the Harvard Medical School), ppk-GAL4 (Bloomington 
Drosophila Stock Center (BDSC), 32078, 32079), Deficiency (Df(2L)Exel7034/ 
CyO; BDSC, 7807), UAS-Dmpiezo-RNAi (National Institute of Genetics, Japan, 
8486R-3), UAS-ppk-RNAi (Vienna Drosophila RNAi Center, 108683), ppk- 
EGFP5 (ref. 22; Y. N. Jan), painless' (BDSC, 27895). 

Generating Dmpiezo knockout flies. The Dmpiezo knockout fly was generated 
by FLP-FRT recombination with two PiggyBac lines as described previously’. The 
recombined knockout fly was confirmed by PCR (Supplementary Fig. 3). The 
genetic background was cleaned using meiotic recombination with w’!”*. 
Imaging. Fluorescence in adult fly or larva was detected by Nikon C2 Confocal 
Laser Point Scanning Microscope, Olympus FluoView500 Confocal Microscope 
or Olympus AX70 microscope. 

Cloning. To clone the enhancer/promoter of the Dmpiezo gene, the genomic 
region between 1.0 kb upstream of the beginning of transcription and the start 
codon of Dmpiezo was amplified by PCR and cloned into the pPTGAL vector. The 
GFP-DmPiezo construct has three alanines as a linker between the carboxy- 
terminal GFP and amino-terminal DmPiezo. The construct was cloned in 
modified pUAST vector to generate transgenic flies and in modified pIRES2- 
EGFP vector for electrophysiology recordings. 

Behavioural assays and statistics. The mechanical nociception was tested as 
described previously*'*”* using calibrated von Frey filaments. The thermal noci- 
ception was tested as described previously'* using a 45 °C heated metal probe. All 
error bars represent mean + s.e.m. 

Isolation of ppk-positive neurons. Third instar larvae that had GFP labelling in 
ppk-positive neurons were dissected, digested with collagenase and mechanically 
triturated. The cells were collected by centrifugation and plated on a poly-p-lysine- 
coated glass coverslip. The fluorescent ppk-positive cells were recorded after 
incubating for 2h at room temperature (23-25 °C). 

Electrophysiology. HEK cells were studied in the whole cell configuration using a 
polished glass probe for stimulation’? and ppk-positive neurons were stimulated 
using negative pressure in the cell attached configuration”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Fly stocks. We used the following stocks: PiggyBacs (PBac{WH}CG8486-f02291, 
PBac{RB}CG8486-e00109, Exelixis Collection at the Harvard Medical School), 
ppk-GAL4 (Bloomington Drosophila Stock Center (BDSC), 32078, 32079), 
Deficiency (Df(2L)Exel7034/CyO, BDSC, 7807), UAS-Dmpiezo-RNAi (National 
Institute of Genetics, Japan, 8486R-3), UAS-ppk-RNAi (Vienna Drosophila RNAi 
Center, 108683), ppk-EGFP5 (ref. 22; Y. N. Jan), painless’ (BDSC, 27895) and 
UAS-DsRed-NLS (J. W. Posakony). The following stocks were from BDSC: 
UAS-GEP, UAS-CD8::GFP, CyO-GEP, w’1’* and Canton-S. 

Engineering Dmpiezo knockout flies. The Dmpiezo knockout fly was generated 
as described in previously described’’. Two PiggyBac lines that carry the FRT 
sequence were selected for FLP-FRT recombination. PBac{ WH}CG8486-f02291 
is inserted in the first intron and PBac{RB}CG8486-e00109 in the 3’ untranslated 
region (UTR) of the Dmpiezo gene. After FLP-FRT recombination, 20 kb of the 
Dmpiezo gene, including all 31 coding exons, was removed and replaced with 7 kb 
of PiggyBac insertion that contained the FRT sequence and white gene. The 
recombined knockout fly was confirmed by PCR reactions (Supplementary Fig. 2). 
The genetic background was cleaned using meiotic recombination with w’1”*. 
Molecular biology. To clone the enhancer/promoter of the Dmpiezo gene, the 
genomic region between 1.0kb upstream of the beginning of transcription and 
the start of the Dmpiezo coding region was amplified by PCR using forward 
primer, 5’-ATCTGGCGGCCGCTATCTATTTTTTAACTAGTGGAAGTCT-3' 
and reverse primer, 5’-TTACTGGTACCATGGATGCCTCCGGCGCCGTTC 
TCCTCCAG-3’. The amplified sequence was cloned into pPTGAL vector 
(Drosophila Genomic Resource Center, 1225) using NotI and KpnI sites and the 
sequence was verified. 

For rescue experiments, Dmpiezo cDNA was amplified from the plasmid 
reported in ref. 11, using forward primer 5'-TATTAGCGGCCGCAGTCTTCA 
GCTATGCGTGCATGGTG-3’ and reverse primer 5'-TAATTCGGTCCGTTAT 
TGCGGTTGCTGTGGCTGCAGTTGCTCCGG-3’ and cloned into a modified 
pUAST vector using NotI and RsrIl. NotI restriction enzyme site was used as a 
linker by providing three alanine residues between EGFP and DmPiezo. The order 
of sequences in the pUAST vector is the following: UAS-kozak-EGFP-3 x (Ala)- 
DmPiezo. To generate transgenic flies, DNA was injected into the isogenized w'"* 
embryos along with transposase A2-3. For the electrophysiology experiment, EGFP— 
DmPiezo was cloned into mammalian expression vector with CMV promoter. 
Behaviour assays. Mechanical nociception was tested as described previously*"*'° 
using calibrated von Frey filaments. Thermal nociception was tested as described 
previously’ using a calibrated heated metal probe. For both nociception assays, 
the number of larvae that showed at least one 360° rotation was counted for each 
trial. The gentle touch assay was performed and each stimulated larva was scored 
as described previously’’. 0 = no response, 1 = hesitates, 2 = turns or withdraws 
anterior segments, 3 = single reverse contractile wave, and 4= multiple waves. 
For all behaviour assays each third instar larva was stimulated only once. All data 
were generated from at least three trials. 

The von Frey filaments for larvae behaviour experiments were modified from 
Touch-Test sensory Evaluator (North Coast Medical) or from monofilament 
fishing lines. Each monofilament was cut to a length of 18 mm, glued into a pipette 
tip so that 9 mm of it protruded and mounted on a hand manipulator with a 90° 
angle. Each von Frey filament was calibrated as described previously’. The force of 
each von Frey stimulator was determined by measuring the weight upon filament 
bending and converting the value into the force: force (mN) = mass (g) X gravity 
acceleration constant (g; 9.8). Each stimulator was calibrated 15 times and its mean 
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value was used in figures. The calibrated forces (mean + s.e.m.) of each stimulator 
are as follows (in mN): 1.78 + 0.15, 2.59 + 0.15, 5.04+0.19, 11.2 + 0.66 and 
59:9 1,79, 

Fluorescence imaging. For identifying tissues or cells expressing fluorescence by 
the Dmpiezo promoter, both adult flies and third instar larvae carrying DmpiezoP- 
GAL4 and UAS-GEP, or UAS-CD8::GFP, were dissected or whole-mounted. For 
double fluorescent labelling in multidendritic neurons, second instar larvae carrying 
ppk-EGFP, DmpiezoP-GAL4 and UAS-DsRed were whole-mounted. For imaging 
ppk-cells expressing GFP-DmPiezo, third instar larvae carrying ppk-GAL4 and 
UAS-GFP-DmPiezo were whole-mounted. Fluorescence images were obtained 
either by Nikon C2 Confocal Laser Point Scanning Microscope, Olympus 
FluoView500 Confocal Microscope or Olympus AX70 microscope. 

Isolation of larvae ppk-positive neurons. In both wild-type and Dmpiezo 
knockout larvae, ppk-positive neurons were fluorescently labelled by ppk-EGFP, 
which is a direct fusion of ppk genomic regulatory regions with EGFP. Third instar 
larvae were dissected in M3 media containing 10% heat inactivated FBS. Each larva 
was cut twice and its internal organs were removed. The cleaned body wall was 
treated with 5mgml ' collagenase type IV at 25°C for 1h in serum-free M3 
media and washed with serum containing M3 media. The enzyme-treated body 
wall was triturated with fire-polished Pasteur pipettes in M3 media with 2mM 
EGTA and 10% FBS. The cuticle and debris were removed by centrifugation at 40g 
and the small size cells including neurons were collected by centrifugation at 360g 
for 10 min. The cell pellet was resuspended with serum containing M3 media and 
plated into a poly-p-lysine-coated coverslip in a small droplet. After 2 h of incuba- 
tion at room temperature (23-25 °C), the coverslips were transferred to the elec- 
trophysiology rig for recording. 

Electrophysiology. For whole-cell recordings in HEK293T cells, patch pipettes 
had resistances of 2-3 MQ when filled with an internal solution consisting of (in 
mM) 133 CsCl, 10 HEPES, 5 EGTA, 1 CaCl, 1 MgCl, 4 MgATP and 0.4 Na2GTP 
(pH adjusted to 7.3 with CsOH). The extracellular solution consisted of (in mM) 
130 NaCl, 3 KCI, 1 MgCl, 10 HEPES, 2.5 CaCl, 10 glucose (pH adjusted to 7.3 
with NaOH). Mechanical stimulation was achieved using a fire-polished glass 
pipette (tip diameter 3-4 jum). The probe had a velocity of 1umms ' during 
the ramp segment of the command for forward motion and the stimulus was 
applied for 150 ms. 

For cell-attached recordings in ppk-positive dissociated neurons, patch pipettes 
had resistances of 3-3.5 MQ when filled with a solution consisting of (in mM) 130 
NaCl, 5 KCl, 10 HEPES, 1 CaCl, 1 MgCl, 10 TEA-Cl (pH 7.3 with NaOH). 
External solution used to zero the membrane potential consisted of (in mM) 
140 KCl, 10 HEPES, 1 MgCly, 10 glucose (pH 7.3 with KOH). Membrane patches 
were stimulated with brief negative pressure pulses through the recording elec- 
trode using a Clampex controlled pressure clamp HSPC-1 device (ALA-scientific). 
Stretch-activated channels were recorded at a holding potential of -80 mV with 
pressure steps from 0 to —60mmHg (—10mmHg increments). Current- 
pressure relationships were fitted with a Boltzmann equation of the form: 
I(P) = (1 + exp (—(P- Pso)/s))—1, where I is the peak of stretch-activated current 
at a given pressure, P is the applied patch pressure (in mm Hg), Psp is the pressure 
value that evoked a current value which is 50% of Imax, and s reflects the current 
sensitivity to pressure. 

All experiments were performed at room temperature. Currents were sampled 
at 50 or 20 kHz and filtered at 5 or 2 kHz. Voltages were not corrected for a liquid 
junction potential. Leak currents before mechanical stimulations were subtracted 
off-line from the current traces. 
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Piezo proteins are pore-forming subunits 
of mechanically activated channels 
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Mechanotransduction has an important role in physiology. Biological processes including sensing touch and sound 
waves require as-yet-unidentified cation channels that detect pressure. Mouse Piezol (MmPiezol) and MmPiezo2 
(also called Fam38a and Fam38b, respectively) induce mechanically activated cationic currents in cells; however, it is 
unknown whether Piezo proteins are pore-forming ion channels or modulate ion channels. Here we show that 
Drosophila melanogaster Piezo (DmPiezo, also called CG8486) also induces mechanically activated currents in cells, 
but through channels with remarkably distinct pore properties including sensitivity to the pore blocker ruthenium red 
and single channel conductances. MmPiezol assembles as a ~1.2-million-dalton homo-oligomer, with no evidence of 
other proteins in this complex. Purified MmPiezol reconstituted into asymmetric lipid bilayers and liposomes forms 
ruthenium-red-sensitive ion channels. These data demonstrate that Piezo proteins are an evolutionarily conserved ion 


channel family involved in mechanotransduction. 


Mechanically activated currents have been described in various 
mammalian cells, including inner ear hair cells’, somatosensory 
dorsal root ganglion neurons’, vascular smooth muscle cells* and 
kidney primary epithelia’. Most of these mechanically activated 
currents are cationic with Ca** permeability, leading to a search for 
cation channels able to convert mechanical forces into such currents. 
Few mechanically activated channels have been described so far>’; 
however, none of the candidates has been shown convincingly to 
mediate the physiologically relevant non-selective cationic mechanically 
activated currents in mammals. 

MmPiezol was recently identified as a protein required for mech- 
anically activated currents in a mammalian cell line. Expressing 
MmPiezol or related MmPiezo2 in a variety of mammalian cell lines 
induces mechanically activated cationic currents*. MmPiezol- 
induced currents are inhibited by GSMTx4 (Grammostola spatulata 
mechanotoxin 4), a peptide widely used to study mechanically activated 
channels’. MmPiezol and MmPiezo2 contain over 30 putative trans- 
membrane domains and do not resemble known ion channels or other 
protein classes. Piezo proteins could be non-conducting subunits of 
cationic ion channels required for proper expression or for modulating 
channel properties®’®"’. Alternatively, Piezo proteins may define a 
novel class of ion channels involved in mechanotransduction. 


Mechanosensitivity of DmPiezo 


Piezo sequences are present in the genomes of many animal, plant and 
other eukaryotic species. Functional analysis of Piezo proteins from 
phylogenetically distant species could demonstrate a conserved role of 
these proteins in mechanotransduction; furthermore, a comparative 
analysis of mechanically activated currents could elucidate unique pore 
properties of channels induced by Piezo proteins from distinct species. 
We focused on the apparently single member of D. melanogaster 
Piezo (DmPiezo), as this invertebrate species is widely used to study 
mechanotransduction using genetic approaches’*-’*. DmPiezo is 24% 
identical to mammalian Piezo proteins, with sequence conservation 


throughout the length of the proteins (Supplementary Fig. 1). We 
cloned the full-length DmPiezo complementary DNA into pIRES2- 
EGFP vector. We recorded mechanically activated currents from fluor- 
escent HEK293T cells expressing DmPiezo-pIRES2-EGFP by applying 
force to the cell surface while monitoring transmembrane currents 
at constant voltage using patch-clamp recordings in the whole-cell 
configuration”’”"*. DmPiezo, but not mock-transfected cells, showed 
large mechanically activated currents (Fig. 1a, b). These currents have a 
time constant of inactivation t of 6.2£0.3ms (n= 32 cells) at 
—80 mV when fitted with mono-exponential function, which is faster 
than observed for MmPiezol (~16ms) and more comparable to 
MmPiezo2 (~7ms)°. Similar to its mammalian counterparts, 
DmPiezo mechanically activated currents are characterized by a 
linear current-voltage (I-V) relationship with a reversal potential 
around 0mV, consistent with it mediating a non-selective cationic 
conductance (Fig. 1c). We further characterized DmPiezo-induced cur- 
rents in HEK293T cells in response to negative pressure pulses applied 
through the recording pipette in the cell-attached mode, an alternative 
mechanosensitivity assay. Overexpression of DmPiezo induced stretch- 
activated currents (Fig. 1d, e) with a pressure for half-maximal 
activation (Pso) of —31.8 + 2.8mm Hg (Fig. 1f), similar to the Pso 
calculated for MmPiezol-induced currents (~30 mm Hg)*. 
Therefore, mechanosensitivity of the Piezo family is conserved in 
invertebrates. We demonstrate the physiological relevance of DmPiezo 
in vivo in an accompanying paper”. 


Pore properties of Piezo proteins 

We next compared fundamental permeation properties of MmPiezol 
and DmPiezo. Ruthenium red, a polycationic pore blocker of TRP 
channels”', blocks MmPiezol- and MmPiezo2-induced mechanically 
activated currents*. We found that ruthenium red is a voltage-dependent 
blocker of MmPiezol1, with an ICs9 value of 5.4 + 0.9 uM at —80 mV 
(Fig. 2a—c): at a concentration of 30 1M, extracellular ruthenium red 
inhibited inward mechanically activated currents without affecting 
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Figure 1 | Human cells expressing Drosophila Piezo (DmPiezo) show large 
mechanically activated currents. a-f, Mechanically activated currents of 
DmPiezo-expressing HE293T cells recorded in the whole-cell (a-c) or cell- 
attached (d-f) configuration. a, Representative traces of mechanically activated 
inward currents at -80 mV in DmPiezo-transfected cells subjected to a series 
of mechanical steps in 1 [1m increments. b, Average maximal current amplitude 
of mechanically activated inward currents at —80 mV. c, Representative -V 
relationship of mechanically activated currents in DmPiezo-transfected cells. 
The inset shows mechanically activated currents evoked at holding potentials 
ranging from —80 to +80 mV. d, Representative currents elicited by negative 
pipette pressure (0 to -60 mm Hg, A20 mm Hg) in DmPiezo-transfected cells. 
e, Average maximal current amplitude of stretch-activated currents at —80 mV. 
f, Imax normalized current-pressure relationship of stretch-activated currents 
recorded at —80 mV in DmPiezo-transfected cells (n = 8 cells) and fitted witha 
Boltzmann equation. Ps9 is the average of P59 values determined for individual 
cells. Bars represent mean + s.e.m. and the number of cells tested is shown 
above bars. ***P < 0.001, Mann-Whitney U-test. 


outwards currents. Such voltage dependence is a characteristic of open 
channel block. A high concentration of ruthenium red (50 UM) 
included in the pipette solution in the whole-cell configuration showed 
no evidence of block, as large mechanically activated currents still 
displayed a linear I-V relationship (Supplementary Fig. 2). These 
results suggest that ruthenium red blocks the pore of MmPiezol- 
induced mechanically activated channels from the extracellular side. 
Notably, DmPiezo-induced mechanically activated currents were 
insensitive to ruthenium red concentrations that potently blocked 
MmPiezol-induced currents (Fig. 2d, e). Together, these results demon- 
strate that overexpression of DmPiezo or MmPiezol gives rise to 
mechanically activated channels with distinct channel properties. 
Next, we set out to determine the single channel conductance (1) of 
mechanically activated channels induced by Piezo proteins by using 
negative-pressure stimulations of membrane patches in cell-attached 
mode. Figure 3 shows the single mechanically activated channel 
properties of MmPiezol or DmPiezo. Openings of stretch-activated 
channels showed a marked difference in amplitude of single channel 
currents (Fig. 3a), as determined from the single channel J-V relation- 
ship for MmPiezol and DmPiezo (Fig. 3b, c). Linear regression of these 
I-V relationships resulted in slope-conductance values in these record- 
ing conditions of 29.9+1.9 and 3.3+0.3pS for MmPiezol- and 
DmPiezo-induced mechanically activated currents, respectively (n = 7 
and 5 cells; mean + s.e.m.). Therefore, DmPiezo-dependent channels are 
ninefold less conductive than MmPiezol-dependent channels. 


MmPiezol oligomerization 

The pore of most ion channels is formed by the assembly of trans- 
membrane domains from distinct subunits (for example, voltage-gated 
K* channels, ligand-gated ion channels) or structurally repetitive 
domains within a large protein (for example, voltage gated Na’ and 
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Figure 2 | Ruthenium red is a channel pore blocker of MmPiezo1- but not 
DmPiezo-induced currents. a, Representative traces of mechanically activated 
currents in MmPiezo1-transfected cells evoked at holding potentials ranging 
from —80 to +80 mV before (left panel) and during perfusion of 30 UM of 
ruthenium red (right panel, red traces). b, Average I-V relationship of 
mechanically activated currents in MmPiezol-transfected cells (n = 7 cells) 
before (black symbols) and during (red symbols) perfusion of 30 1M 
ruthenium red. Currents were normalized to the value of control current 
evoked at —80 mV for each individual cell. c, Concentration-inhibition curve 
for ruthenium red (RR) on mechanically activated currents evoked at -80 mV 
in MmPiezol-transfected cells and fitted with a Boltzmann equation. Each data 
point is the mean + s.e.m. of 3-13 observations. d, Representative traces of 
Piezo-dependent mechanically activated currents evoked at —80 mV in the 
presence of ruthenium red. e, Blocking effect of ruthenium red on Piezo- 
dependent mechanically activated currents evoked at —80 mV. Bars represent 
mean + s.e.m. and the number of cells tested is shown above the bars. 

**P < 0.01; ***P < 0.001; unpaired f-test. 


Ca** channels). As Piezo proteins lack repetitive transmembrane 
motifs presumably they oligomerize to form ion channels. To test this 
hypothesis, we determined the number of subunits in Piezo complexes 
by expressing GFP-MmPiezol fusion proteins in Xenopus laevis 
oocytes, imaging individual spots with total internal reflection micro- 
scopy (TIRF), and counting discrete photobleaching steps (Fig. 4a, b 
and ref. 22). Amino-terminal GFP~MmPiezo1 functionality was con- 
firmed by overexpression in HEK293T cells (Supplementary Fig. 3). 
We used several GFP fusion constructs of ion channels with known 
stoichiometry as controls: voltage-gated Ca?* channel («%1E-GFP; 
monomer), NMDA (N-methyl-D-aspartate) receptor (NR1 co-expressed 
with NR3A-GFP; dimer of dimers) and cyclic nucleotide gated (CNG) 
channel (XfA4-GFP; tetramer)”. We found that complexes of 
MmPiezo1 frequently exhibited at most four photobleaching steps, con- 
sistent with the idea that Piezo proteins homo-multimerize. Fluorescent 
MmPiezol (or CNG) complexes exhibiting bleaching in fewer than four 
steps can be explained by non-functional GFP or pre-bleached GFP” or 
general bias against noisier multi-step traces during data analysis (see 
Methods). Histograms of the number of photobleaching steps observed 
for MmPiezol complexes were comparable to histograms obtained 
from tetrameric CNG channels (Fig. 4c). These results suggest that in 
living cells, Piezo proteins can assemble as homo-multimers. 

We further characterized Piezo proteins biochemically by heterolo- 
gously expressing and purifying MmPiezol carboxy-terminally fused 
with a glutathione S-transferase (MmPiezol-GST). Functionality of 
MmPiezol-GST was confirmed by overexpression in HEK293T cells 
(Supplementary Fig. 3). We observed a protein band at a position near 
the 260-kDa protein marker on a Coomassie-blue-stained denatur- 
ing protein gel (Supplementary Fig. 4a). Western blot with a GST 
(Schistosoma japonicum form) antibody (Supplementary Fig. 4b) or 
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Figure 3 | MmPiezol- and DmPiezo-induced stretch-activated channels 
have different conductances. a, Representative Piezo-dependent stretch- 
activated channel openings elicited at —180 mV. Bottom traces represent 
average of 40 individual recording traces. b, All-point histograms of single 
channel opening events (average of 10 and 20 individual events for MmPiezol 
and DmPiezo, respectively) at different holding potentials (V},). c, Average I-V 
relationships of stretch activated single channels in MmPiezol and DmPiezo 
transfected cells (n = 7 and 5 cells, respectively; mean + s.e.m.). Single channel 
amplitude was determined as the amplitude difference in Gaussian fits as 
shown in b. 


a MmPiezol-specific antibody* (Fig. 4) confirmed the presence of 
MmPiezol-GST in the MmPiezol-GST sample. Using native gel 
electrophoresis and Coomassie blue staining, we detected a prominent 
protein band at a position near the 1,236 kDa protein marker only in 
the MmPiezol-GST sample (Fig. 4d). Western blot using MmPiezol 
antibody confirmed that this major band contains MmPiezol 
(Fig. 4e). These data indicate that the purified MmPiezol-GST protein 
complex has a molecular weight of about 1.2 million Da, four times the 
predicted molecular weight of a single MmPiezol-GST polypeptide 
(318 kDa). Next, we asked whether any endogenous proteins are 
present in this MmPiezol-containing complex. Mass spectrometry 
of the ~1.2 million Da protein complex mainly detected peptides 
derived from MmPiezol-GST, but not from other endogenous 
membrane proteins. Although several non-transmembrane proteins 
were also detected, most of them were also present in the control 
sample, indicating an absence of specific interacting proteins in the 
complex (Supplementary Table 1). Moreover, mass spectrometry of 
the whole purified solution samples before gel electrophoresis con- 
firmed that no other ion channel protein was detected (Supplementary 
Table 2). This indicates that MmPiezol is not tightly associated with 
any endogenous pore-forming protein. 

To examine further whether this Piezo complex is indeed a tetramer, 
we treated the purified MmPiezol-GST protein with the crosslinker 
formaldehyde and subjected the samples to denaturing gel electrophoresis 
and western blotting. Formaldehyde-treated samples contained three 
major additional higher-order Piezo-containing bands, with longer 
treatments increasing the prominence of the higher bands (Fig. 4f). 
The distribution of the bands on the 3-8% gradient gel suggests that 
the four bands correspond to monomer, dimer, trimer and tetramer 
of MmPiezol-GST (Fig. 4f). The observation that MmPiezol is 
crosslinked by formaldehyde, a crosslinker with a relative short spacer 
arm (2.3-2.7 A), suggests that the subunits form a tetramer. 
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It is possible that MmPiezol oligomers associate with other proteins; 
however, such an association might not withstand the GST purification 
step. To probe this, we performed paraformaldehyde (PFA) crosslink- 
ing experiments on living cells before the purification procedure. On a 
native gel, the MmPiezol-GST complex purified from PFA-treated 
cells also migrated to the position near the 1,236 kDa protein marker, 
similar to the sample from untreated cells (Fig. 4g). On a denaturing 
gel, on-cell PFA treatment resulted in four distinct MmPiezo1 -specific 
bands, similar to results of formaldehyde treatment on the purified 
complex (Fig. 4h). This suggests that MmPiezol is not tightly asso- 
ciated with other proteins large enough to alter discernibly its size on 
denaturing gels, and confirms the results from mass spectrometry 
analysis. However, our crosslinking studies with PFA might miss weak 
interactors with MmPiezol. Regardless, together with the results 
obtained from single-molecule photobleaching analysis in living cells, 
our biochemical data suggest that MmPiezol forms a homomultimeric 
ion channel, most likely as a homotetramer. 


MmPiezol reconstitution in lipid bilayers 


Finally, to assess whether Piezo proteins were sufficient to recapitulate 
the channel properties recorded from Piezo-overexpressing cells, we 
reconstituted purified MmPiezol proteins into lipid bilayers in two 
distinct configurations: droplet interface lipid bilayers (DIBs) assembled 
from two monolayers’** (Fig. 5a-e and 1-q) and proteoliposomes”® 
(Fig. 5f-h). In the first configuration, MmPiezol was reconstituted 
into asymmetric bilayers that mimic the cellular environment: the 
extracellular-facing lipid monolayer is predominantly neutral whereas 
the intracellular-facing leaflet is negatively charged’’. In contrast, the 
lipid composition of the bilayer in the second configuration is uniform. 

In the DIBs setting, representative segments from a 6-min record- 
ing obtained at —100mV show brief, discrete channel openings 
(Fig. 5a, b) blocked by addition of 50 uM ruthenium red to the neutral 
facing compartment (Fig. 5c). In contrast, no effect was observed 
when ruthenium red was introduced into the negative-facing com- 
partment (not shown). We detected efficient block of channel activity 
even at 5 uM ruthenium red (not shown). The asymmetric accessibility 
of ruthenium red block of reconstituted channels agrees with the data 
obtained from MmPiezol-overexpressing HEK293T cells (Fig. 2 and 
Supplementary Fig. 2), thereby establishing the fidelity of the assays 
and validating MmPiezol protein as an authentic ion channel. The 
Piezo currents exhibit ohmic behaviour; records displayed at higher 
resolution (Fig. 5b) clearly demonstrate the occurrence of unitary 
events with y values obtained from conductance histograms of 
118 + 15 pS and 80 + 6 pS (n = 6) in symmetric 0.5 M KCl from the 
negative and positive branches of I-V plots, respectively (Fig. 5d, e). 

A similar pattern of activity was obtained from MmPiezol recon- 
stituted in asolectin liposomes” (Fig. 5f-k). A selection of recordings 
shows the presence of two channels in the membrane which reside 
predominantly in the open state (Fig. 5f, g), as discerned in a higher 
time resolution display (Fig. 5k). These recordings were obtained in 
the presence of 50 LM ruthenium red inside the recording pipette, to 
ensure functional selection ofa single population of MmPiezo1 channels 
facing the ruthenium-red-free compartment. MmPiezol in asolectin 
proteoliposomes under these conditions (symmetric 0.2 M KCl) exhibits 
a y=110+10pS at V=—100mV and 80+5 pS at V=100mV 
(Fig. 5h-j) (n = 8). Finally, reconstitution of control samples purified 
from non-transfected cells as well as heat-denatured purified 
MmPiezol-GST into either bilayer systems under otherwise identical 
conditions failed to reproduce this pattern of channel activity (not 
shown). 

We then tested the ability of the reconstituted MmPiezol to conduct 
sodium (Fig. 51-q). Initially, single channel currents were recorded 
from asymmetric bilayers in symmetric 0.2M KCl; y=58+5pS 
(Fig. 51, 0). Subsequent addition of 0.2M NaCl in the presence of 
0.2 M KClincreased the unitary conductance of reconstituted channels 
to 95 + 5 pS (Fig. 5m, p) while retaining sensitivity to ruthenium red 
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Figure 4| MmPiezol forms homo-oligomers. a, Representative image of an 
acquired sequence showing three selected GFP—MmPiezo1 spots in the cell 
membrane. Levels were adjusted for clarity. Scale bar, 0.8 tum. b, Representative 
traces of fluorescence intensities of indicated single GFP-fusion constructs. 
Black arrows indicate photobleaching steps. c, Histograms of the average 
number of bleaching steps observed in ten or more movies from four or more 
oocytes of single fluorescent complexes of indicated constructs. d, e, Indicated 
samples purified and separated on native gels and visualized by Coomassie 
staining (d) or western blotting (e). Asterisk in d indicates a protein band 


block (Fig. 5n, q). These results confirm that these channels conduct 
both sodium and potassium as would be expected from a cationic non- 
selective channel. This assertion was further substantiated by recording 
MmPiezol currents from proteoliposomes under bi-ionic conditions 
(0.2 M KCI/0.2 M NaCl) (Supplementary Fig. 5a—h). A summary of the 
I—V relation for the MmPiezol channel, extracted from 204,088 
events obtained in three experiments, shows that the single channel 
current is ohmic between — 100 and 200 mV with a slope conductance 
of 102 + 2 pS (Supplementary Fig. 5i). The current reversed direction 
at 0.0+0.3mV, demonstrating that the channel does not select 
between K* and Na‘, and importantly, displays open channel block 
by ruthenium red (Supplementary Fig. 5j-1). 

The difference in y between overexpressed MmPiezo1 in cells and 
reconstituted MmPiezol in lipid bilayers may be attributed to many 
variables, including the distinct lipid environments which are known 
to influence conductance measurements strongly’ **. Moreover, the 
ionic conditions used in the two systems are different, as divalent 
cations present in HEK293T cell-attached experiments also affect 
the conductance values. Indeed, when divalent cations are excluded 
from the recording pipette, y of MmPiezol-induced currents in 
HEK293T cells is 58.0pS+1.5pS (150mM NaCl solution, 
Supplementary Fig. 6), compared to 29.9 + 1.5 pS in the presence of 
divalent ions (Fig. 3). The near equivalence of y values together with 
the similar pattern of channel activity demonstrates that reconstitution 
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specifically present in the MmPiezol sample. f, Purified MmPiezol-GST 
proteins treated with or without formaldehyde (FA) with the indicated time 
period, separated on a denaturing gel and detected with the anti-MmPiezol 
antibody. Sample purified from cells without transfection served as a negative 
control. g, h, MmPiezol-GST-transfected HEK293T cells or untransfected 
cells treated with or without 0.25% PFA for 10 min. The crosslinked 
MmPiezo1-GST proteins were purified and separated on native gel (g) or 
denaturing gels (h), followed by western blotting. Panels d—h are 
representatives of at least three independent experiments. 


of MmPiezol into two distinct bilayer systems produces channels with 
identical functional properties (Supplementary Table 3). 

Future reconstitution and recording of DmPiezo in lipid bilayers 
will show whether the difference in conductance between MmPiezol 
and DmPiezo arises from intrinsic properties. The membrane environ- 
ment and lipid composition are known to modulate the activity of the 
embedded channel proteins in a drastic and deterministic manner (for 
example, see refs 28-32). It is not entirely surprising that the conditions 
to emulate the cellular environment in the reconstituted system in 
terms of the mechanical state of the membrane or its lipid composition 
have thus far been inadequate to retrieve the activation features of 
mechanically activated ion channels. Furthermore, the complexity of 
protein clusters and dynamic cytoskeletal interacting partners at the 
cell membrane”? introduce regulatory constraints on channel activity. 
Further investigation may clarify whether Piezo ion channel subunits 
are intrinsically mechanosensitive or use unknown interacting 
partners to sense membrane tension. 


Concluding remarks 

We provide compelling evidence to support the hypothesis that Piezo 
proteins are indeed ion channels. First, overexpression of DmPiezo or 
MmPiezol in a human cell line gives rise to mechanically activated 
channels with distinct biophysical and pore-related properties. 
Second, isolated Piezo complexes do not contain detectable amounts 
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Figure 5 | MmPiezol forms ruthenium-red-sensitive ion channels. 

a-e, Reconstitution of purified MmPiezol into asymmetric lipid bilayers. 

a, Representative single channel currents at —100 mV. The section of the 
recordings indicated by the red asterisk is shown in b at a 10-fold higher time 
resolution. c, After 35 min of recording the channel activity shown in 

a, injection of 50 uM ruthenium red onto the neutral facing compartment 
blocks MmPiezol currents. d, All-event current amplitude histogram of a 
6-min recording; y = 124 + 7 pS. The total number of opening events (N) 
analysed was 18,424. e, Single channel I-V relationship, n = 6 experiments. 
f-k, Reconstitution of purified MmPiezo1 into asolectin proteoliposomes. 
Representative channel currents recorded at —100 mV (f) and + 100 mV (g) in 
the presence of 50 1M ruthenium red inside the recording pipette. Two open 
channels are present in the membrane. The segment of the 15 min recording 
shown in g indicated by the red asterisk is displayed in k at a 25-fold higher time 
resolution. h, i, All-event current amplitude histograms from 30s (h) and 

15 min (i) recordings: y = 110 + 10 pS (h) and 80 + 5 pS (i); N was 9,938 
events. j, Single channel I-V relationship, n = 8 experiments. 

1-q, Representative single channel currents at —100 mV of purified MmPiezol 
reconstituted into asymmetric lipid bilayers in symmetric 0.2 M KC] (J), after 
addition of 0.2 M NaCl (m) and after addition of 50 uM ruthenium red 

(n). Segments indicated by red asterisks in ln are displayed in panels 

o-q, respectively. C and O denote the closed and open states. 


of other channel-like proteins. Finally, purified MmPiezol protein 
reconstituted into proteoliposomes and planar lipid bilayers in the 
absence of any other cellular components gives rise to ruthenium-red- 
sensitive cationic ion channel activity. The MmPiezol complex is 
estimated to weigh ~1.2-million Da with 120-160 transmembrane 
segments, being, to our knowledge, the largest plasma membrane ion 
channel complex identified so far. 


METHODS SUMMARY 

Electrophysiology. Mechanical stimulation was achieved as previously described’. 
Subunit counting. The preparations were imaged on an inverted Nikon Ti-E 
fluorescence TIRF microscope (Nikon Corporation) and imaged with a high 
numerical aperture objective (Nikon 100 PlanApo, NA1.49). eGFP-fusion proteins 
were excited with a 488-nm Coherent laser (Coherent, Inc.) and images were 
collected with an Andor iXon DU-897 EMCCD camera. 

MmPiezo1-GST purification. Cells were collected and lysed 24h after transfec- 
tion, followed by an affinity purification. Initially, purification was conducted 
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from whole-cell lysates. Thereafter, purification was performed using the 
membrane fraction as starting material, resulting in significantly enhanced 
frequency of retrieval of channel activity after reconstitution. Untransfected cells 
were subjected to the same purification procedure to serve as a negative control. 
Purified samples were kept at 4°C until further analysis. 

Native gel electrophoresis. The purified MmPiezol-GST proteins or negative 
control samples were subjected to 3-12% NativePAGE Novex Bis-Tris gel for native 
(non-denaturing) electrophoresis according to the user manual (Invitrogen). After 
electrophoresis, the native gel was then either visualized by a fast Coomassie G-250 
staining or transferred to a PVDF membrane for western blotting. 
Reconstitution in lipid bilayers and proteoliposomes. Purified MmPiezol was 
reconstituted into proteoliposomes by detergent dilution. Excised patches from 
giant asolectin proteoliposomes were used for channel recordings. Asymmetric 
lipid bilayers were formed using the droplet interface strategy; one monolayer was 
composed of 1,2-diphytanoyl-sn-glycero-3 phosphocholine (DPhPC), and the 
other of 90% DPhPC and 10% of the negatively charged lipid, 1,2-dioleoyl-sn- 
glycero-3-phosphatidic acid (DOPA) (mole/mole) (Avanti Polar Lipids). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Cloning of Drosophila piezo full-length cDNA. The Drosophila piezo gene 
(GenBank accession number JQ425255) was cloned from adult Drosophila 
poly(A) * RNAs (Clonetech) by RT-PCR. Primers for RT-PCR were designed 
based on the annotated sequence of CG8486. Two fragments of 2 kb and 6.5 kb 
were amplified and cloned sequentially into pIRES2-EGFP expression vector. 
Each cloning step was sequence verified. Full-length Drosophila piezo gene is 
8,355bp in length. The protein sequence of DmPiezo is shown in 
Supplementary Fig. 1. 

Cell culture and transient transfection. Human embryonic kidney 293T 
(HEK293T), NIH-3T3, F11 and HeLa cells were grown in Dulbecco’s Modified 
Eagle Medium containing 4.5mgml~* glucose, 10% fetal bovine serum, 
50U ml’ penicillin and 50p1gml~’ streptomycin. Cells were plated onto 
poly-lysine-coated 12-mm round glass coverslips placed in 24-well plates and 
transfected using lipofectamine 2000 (Invitrogen) according to the manufac- 
turer’s instruction. 500-1,000 ng ml~ lof plasmid DNA was transfected and cells 
were recorded 12-48 h later. 

Electrophysiology. Patch-clamp experiments were performed in standard 
whole-cell or cell-attached recordings using an Axopatch 200B amplifier (Axon 
Instruments). Patch pipettes had resistance of 2-3 MQ when filled with an 
internal solution consisting of (in mM) 133 CsCl, 10 HEPES, 5 EGTA, 1 CaCh, 
1 MgCl, 4 MgATP and 0.4 Na,GTP (pH adjusted to 7.3 with CsOH). The extra- 
cellular solution consisted of (in mM) 130 NaCl, 3 KCl, 1MgCl:, 10 HEPES, 
2.5 CaCl, 10 glucose (pH adjusted to 7.3 with NaOH). For cell-attached record- 
ings, pipettes were filled with a solution consisting of (in mM) 130 NaCl, 5 KCl, 
10 HEPES, 1CaClh, 1MgCl,, 10TEA-Cl (pH7.3 with NaOH), except for 
Supplementary Fig. 6 where the internal solution was (in mM) 150 NaCl, 
10 HEPES (pH adjusted to 7.3 with NaOH). External solution used to zero the 
membrane potential consisted of (in mM) 140KCIl, 10 HEPES, 1MgCh, 
10 glucose (pH 7.3 with KOH). All experiments were done at room temperature. 
Currents were sampled at 50 or 20 kHz and filtered at 5 or 2 kHz. Voltages were 
not corrected for a liquid junction potential. Leak currents before mechanical 
stimulations were subtracted off-line from the current traces. 10 mM ruthenium 
red stock solution was prepared in water. 

Mechanical stimulation. For whole-cell recordings mechanical stimulation was 
achieved using a fire-polished glass pipette (tip diameter 3-4 1m) positioned at an 
angle of 80° to the cell being recorded. Downward movement of the probe 
towards the cell was driven by a Clampex controlled piezo-electric crystal micro- 
stage (E625 LVPZT Controller/Amplifier; Physik Instrumente). The probe was 
typically positioned ~2 jim from the cell body. The probe had a velocity of 1 jm 
ms | during the ramp segment of the command for forward motion and the 
stimulus was applied for 150 ms. To assess the mechanical sensitivity of a cell, a 
series of mechanical steps in 1 um increments was applied every 10-20 s, which 
allowed full recovery of mechanosensitive currents. Inward mechanically activated 
currents were recorded at a holding potential of —80 mV. For J-V relationship 
recordings, voltage steps were applied 0.7 s before the mechanical stimulation from 
a holding potential of —60 mV. 

For cell-attached recordings, membrane patches were stimulated with brief 
negative pressure pulses through the recording electrode using a Clampex 
controlled pressure clamp HSPC-1 device (ALA-scientific). Unless otherwise 
stated, stretch-activated channels were recorded at a holding potential of 
—80mV with pressure steps from 0 to -60 mmHg (— 10 mm Hg increments). 
Current-pressure relationships were fitted with a Boltzmann equation of the 
form: I(P) = [1 + exp(—(P - Ps9)/s)] ~1 where I is the peak of stretch-activated 
current at a given pressure, P is the applied patch pressure (in mm Hg), Ps» is the 
pressure value that evoked a current value which is 50% of Imax, and s reflects the 
current sensitivity to pressure. 

Single-channel amplitude characterization was performed on patches that 
showed strong stretch-activated current activity at -80 mV using increasing steps 
of negative pressure up to —60 mm Hg. Similar activity was never present in 
control-transfected cells. Negative pressure steps were then reduced to low to 
moderate level (—5 to —20 mm Hg) allowing detection of single channel openings. 
Subunit counting. For oocyte injection, all construct plasmids were linearized at 
C terminus with Nhel, HindIII or NotI and DNA transcribed with T7 mMessage 
mMachine Kit (Ambion) and poly(A)-tailing Kit (Ambion) and cleaned with 
LiCl precipitation. 50 nl of 0.2 ug pl’ mRNA was injected into Xenopus oocytes 
(Nasco). 

For acquisition, 12-24h after injection, oocytes were osmotically shocked in 
stripping buffer (in mM: 220 N-methyl glucamine aspartate, 10 HEPES, 1 MgCl) 
and mechanically de-vitellinated. MatTek dishes (MatTek Corporation) were 
prepared by sonication in 1 M KOH to remove background fluorescence and 
further sonicated in MilliQ dH2O. Oocytes were placed onto MatTek dishes into 
SOS buffer (in mM: 100 NaCl, 2 KCl, 1.8 CaCl,-H,0, 1 MgCl,-6H,O, 5 HEPES, 
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2.5 Na pyruvate and 50p1gml~! gentamicin, pH7.0). The preparations were 
imaged on an inverted Nikon Ti-E fluorescence TIRF microscope (Nikon 
Corporation) and imaged with a high numerical aperture objective (Nikon 
100X PlanApo, NA1.49) with an additional X1.5 Optovar magnification. eGFP 
fusion proteins were excited with a 488-nm Coherent laser (Coherent, Inc.) and 
images were collected with an Andor iXxon DU-897 EMCCD camera. Sixty-second 
movies were collected at 100-ms exposures, for a frame rate of 10 Hz. 

Using Nikon Elements software, movies were duplicated and processed with a 
rolling average of 2. A second duplicate was filtered with a low-pass kernel of 7, to 
remove background. The low-pass images were subtracted from the averaged 
images, to produce the movies used for analysis. Non-overlapping 4 X 4 pixel 
regions of interest were drawn around randomly selected spots that were clearly 
separated from neighbouring bright pixels. The spots were required to fit entirely 
within the 4 X 4 pixel regions. Pixel size was 0.11 um. The average intensity of 
each region was plotted over the length of the movies. Traces were discarded if the 
intensity increased after an initial decrease, if the fluorescent spot moved out of 
the region, or if the fluorescent signal showed a continuous decay instead of step- 
wise bleaching. Finally, the number of bleaching steps was counted for each spot. 
MmPiezo1-GST purification. The MmPiezol-GST construct was subcloned by 
inserting a GST encoding sequence from Schistosoma japonicum into the 
MmPiezol construct* at the 3’ end of MmPiezol cDNA sequence using the AscI 
and SaclI restriction enzyme sites. The resulting MmPiezol-GST fusion protein 
has 2,773 amino acids. 

After incubation with cell lysates overnight at 4 °C, the glutathione beads were 
washed four times in a buffer containing 25 mM NaPIPES, 140 mM NaCl, 0.6% 
CHAPS, 0.14% phosphatidylcholine (PC), 2.5mM dithiothreitol (DTT), and a 
cocktail of protease inhibitors and eluted with 100 mM glutathione in a buffer 
containing 25 mM NaPIPES, 50 mM Tris, 0.6% CHAPS, 0.14% PC, 2.5 mM DTT 
and a cocktail of protease inhibitors. The eluant was dialysed against a buffer 
containing 25 mM NaPIPES, 0.6% CHAPS, 0.14% PC, 2.5 mM DTT and a cocktail 
of protease inhibitors. The purified samples were kept at 4°C. Samples purified 
according to this protocol were used for all the biochemical work and the initial 
reconstitution experiments. However, because retrieval of channel activity from 
the reconstituted MmPiezo1-GST fluctuated from preparation to preparation, we 
adopted an alternative purification protocol involving the membrane fraction as 
the starting material. Specifically, 24h after transfection, cells were collected and 
homogenized in a buffer containing 25 mM NaPIPES, 50 mM NaCl, 2.5 mM DTT, 
anda cocktail of protease inhibitors. The cell suspension was forced to go through a 
25.5G needle for 20 times and centrifuged at 1,000g for 15min at 4°C. The 
supernatant was collected and centrifuged at 167,000g for 30 min at 4°C. The 
resulting membrane fraction was washed three times (using a buffer containing 
25mM NaPIPES, 150 mM NaCl, 2.5mM DTT, and a cocktail of protease inhibi- 
tors) and used as the starting material for MmPiezo1-GST purification using the 
same procedure described above. Purification from the membrane fraction greatly 
reduced the content of endogenous GST proteins and significantly enhanced the 
frequency of retrieval of MmPiezol channel activity after reconstitution (Fig. 5, 
Supplementary Fig. 5 and Supplementary Table 3). 

NativePAGE Novex Bis-Tris gel. The purified MmPiezol-GST proteins and 
control samples were subjected to 3-12% NativePAGE Novex Bis-Tris gel for 
native (non-denaturing) electrophoresis according to the User Manual 
(Invitrogen). In brief, samples were mixed with NativePAGE Sample Buffer 
and NativePAGE 5% G-250 Sample Additive and then subjected to electrophoresis 
at 150V for 2h. The use of G-250 charge-shift in NativePAGE gels results in 
protein resolution based upon protein size and therefore allows accurate size 
estimation of native protein complexes**. However, the native protein conforma- 
tion may give an expected size estimation error of ~15%. After electrophoresis, the 
native gel was then either visualized by a fast Coomassie G-250 staining or trans- 
ferred to a PVDF membrane for western blotting with an antibody specifically 
against Piezol proteins. 

Formaldehyde and paraformaldehyde crosslinking. The purified MmPiezol- 
GST proteins were treated with or without 0.1% formaldehyde at room temper- 
ature for different periods of time and then mixed with NuPAGE LDS Sample 
Buffer and NuPAGE Reducing Agent (Invitrogen), followed by heating at 70 °C 
for 10 min to denature the protein. The treated samples were subjected to 3-8% 
NuPAGE Tris-Acetate gel electrophoresis under denaturing conditions. For live 
cell crosslinking, 0.25% concentration of PFA was added to the cell culture 
medium and kept at room temperature for 10min, followed by adding 
125mM glycine to stop the PFA crosslinking reaction. Treated cells were 
collected and subjected to sequential steps of protein purification, 3-8% 
NuPAGE Tris-Acetate gel electrophoresis under denaturing conditions or 
3-12% NativeP AGE Novex Bis-Tris gel for native (non-denaturing) electrophoresis, 
and western blotting with the anti-Piezol antibody. 
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Western blotting. After electrophoresis, either the native or denaturing PAGE 
gels were transferred to PVDF membranes. Transferring protein from native gel 
to PVDF membranes was conducted according to instructions for NativeP AGE 
Novex Bis-Tris gel system. Transferred PVDF membranes were blocked with 5% 
milk in TBS buffer with 0.1% Tween-20 (TBST buffer) at room temperature for 
1h, and then incubated with the anti-Piezol antibody (1:200) at 4 °C overnight. 
The membranes were washed with TBST buffer and incubated with peroxidase- 
conjugated anti-rabbit IgG secondary antibody (1:10,000) at room temperature 
for 1h. Proteins were detected with the ECL plus detection kit (GE Healthcare). 
Mass spectrometry. Purified samples were separated on the 3-12% NativePAGE 
Novex Bis-Tris gel and visualized by fast Coomassie G-250 staining. The gel 
band containing the MmPiezol-GST complex or the corresponding blank band 
from the control sample near the 1,236 kDa molecular marker was excised and 
subjected to the Scripps Center for Mass Spectrometry for analysis. In brief, the 
gel bands were destained, reduced with 10mM DTT, alkylated with 55mM 
iodoacetamide, and digested with Trypsin overnight before analysis using the 
nano-LC-MS/MS. The nano-LC-MS/MS data obtained on a LTQ ion trap mass 
spectrometer was searched using the MmPiezol-GST protein sequence and 
NCBInr Homo sapiens database. In separate sets of experiments, the purified 
MmPiezo1-GST and control solution samples before gel electrophoresis were 
subjected to mass spectrometry (Supplementary Table 2). 

Reconstitution into proteoliposomes or DIBs, single channel recordings and 
analysis. Purified MmPiezol-GST protein was reconstituted into asolectin 
(soybean polar lipid extract, Avanti) liposomes (10 mg ml ') by incubating the 
mixture (lipid/protein mass ratios between 2,000:1 and 1,000:1; this corresponds 
toa molar lipid/protein ratio of ~800,000-400,000:1) on ice for 5 min followed by 
X20 dilution in 200 mM KCI, 5 mM MOPS pH 7.0 and incubated with rotation at 
room temperature for 20 min. Biobeads were added to mixture and incubated 
with rotation for 1h. Thereafter, biobeads were removed by filtration and a new 
batch of beads was added. After 30 min incubation, the biobeads were filtered and 
the sample was centrifuged at 60,000 r.p.m. for 60 min at 8 °C. The proteoliposome 
pellet was re-suspended in 40 ul of the same buffer and used to place two 25 ul 
drops on a cover slide. The samples were dried under vacuum for >16h at 4°C. 
Samples were hydrated with 25 ul of the same buffer and allowed to sit for 2h 
before starting recordings. Thereafter, 2-3 il of proteoliposomes were withdrawn 
from the edge of the spots on the cover slide and transferred to the recording 
chamber. After 5 min, the chamber was slowly filled with recording solution. 
Multi-GQ seals were made to proteoliposomes immobilized at the bottom of the 


recording chamber. At that time, the proteoliposome patch was excised and 
brought through the air-water interface. Excised patches were used**. Pipette 
and bath solution contained (in mM) 200 KCl, 5 MOPS titrated to pH7.0 with 
KOH. Capillaries of borosilicate glass from Sigma were pulled to yield resistances 
of 1-2 MQ when immersed in recording solution. 

Droplet interface lipid bilayers (DIBs) were formed between two lipid monolayer- 
encased aqueous nanolitre droplets submerged in hexadecane”’. Liposomes were 
composed of 1,2-diphytanoyl-sn-glycero-3 phosphocholine (DPhPC) or 90% 
(mole/mole) DPhPC and 10% of the negatively charged lipid, 1,2-dioleoyl-sn- 
glycero-3-phosphatidic acid (DOPA) (Avanti Polar Lipids). MmPiezol was 
diluted directly into the liposome suspension to yield a final concentration of 
2-5ngml ’. The electrode carrying the droplet with MmPiezol and desired 
buffer-lipid mix (in mM, 500 KCl, 10 HEPES, pH 7.4, 0.5 lipid solution of 
DPhPC) was connected to the grounded end of the amplifier head-stage 
(Axopatch 200B). The second electrode, in a droplet containing the same buffer 
and 10% DOPA:90% DPhPC, was connected to the working end of the head-stage. 
Where indicated, ruthenium red or 0.2 M NaCl was injected using a nano-injector 
(WPI, Inc.). 

For proteoliposome patches, records were acquired at a sampling frequency of 
40 kHz and filtered online to 5 kHz with a 3-pole Bessel filter before digitization; 
for DIBs, data acquisition was at 10 kHz and filtered at 2 kHz. For analysis and 
presentation, records were filtered to 1 kHz with a low-pass Gaussian filter. 
Transitions were detected by the half-threshold method implemented in 
Clampfit (proteoliposomes) and by the segmental k-means method (SKM) 
implemented in QuB (DIBs). Transitions <0.5 ms were excluded from the pool 
for analysis to correct for detection of false and missed events. Data were analysed 
using Clampfit v.9.2 software (Axon Instruments), QuB, Excel 2007 (Microsoft), 
and IGOR Pro (Wavemetrics). y was calculated from Gaussian fits to currents 
histograms. All statistical values represent mean + s.e.m., unless otherwise indi- 
cated. n and N denote number of experiments and number of events, respectively. 
All experiments were done at room temperature. 
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Pressure has an essential role in the production’ and control”* of 
superconductivity in iron-based superconductors. Substitution of a 
large cation by a smaller rare-earth ion to simulate the pressure 
effect has raised the superconducting transition temperature T, to 
a record high of 55 K in these materials**. In the same way as T, 
exhibits a bell-shaped curve of dependence on chemical doping, 
pressure-tuned T, typically drops monotonically after passing the 
optimal pressure’ *. Here we report that in the superconducting 
iron chalcogenides, a second superconducting phase suddenly re- 
emerges above 11.5 GPa, after the T. drops from the first maximum 
of 32 K at 1 GPa. The T, of the re-emerging superconducting phase 
is considerably higher than the first maximum, reaching 48.0- 
48.7 K for Tlo,6Rbo,4Fe1.67Se2, Ko,.gFe;,7Se2 and Ko.gFe,.7gSe. 

The recent discoveries of superconductivity at 30-32 K in a new 
family of iron-based chalcogenide superconductors*” A, — ,Fe, — Se, 
(where A = K, Rb or Cs, with possible Tl substitution) bring new excite- 
ment to the field of superconductivity’’. These superconductors have 
unusually large magnetic moments up to 3.3/g per Fe atom and a Fe- 
vacancy ordering in the Fe square lattice’. How superconductivity with 
such a high T. can exist on such a strong magnetic background remains 
perplexing"®. It has been established that superconductivity in strongly 
correlated electronic systems can be dictated by their crystallographic 
structure, electronic charge, and orbital and spin degrees of freedom, 
which can all be manipulated by controlling parameters such as pressure, 
magnetic field and chemical composition’*-’>. Pressure is a ‘clean’ way 
to tune basic electronic and structural properties without changing the 
chemistry. High-pressure studies are thus very useful in elucidating 
mechanisms of superconductivity as well as in searching for new 
high-T. superconducting materials. 

We studied single crystals of Tly.Rbo 4Fey,67Se2, Ko.gFe;.7Se, and 
Ko gFe;.7gSe2 grown by the Bridgman method*!*’”. We conducted both 
high-pressure resistance and susceptibility measurements to detect 
superconductivity in situ at high pressures and low temperatures. 
Figure 1 shows the temperature dependence of the electrical resistance 
at different pressures for Tlo 6Rbo.4Fe;.57Se. single crystals. Here we 
define T, as the intersection of the tangent through the inflection point 
of the resistive transition with a straight-line fit of the normal state just 
above the transition. As can be seen, T- starts at the maximum of 33 K 
at 1.6 GPa, shifts to lower temperatures at increasing pressures, and 
vanishes near 9 GPa in our experimental temperature range, which is 
300-4 K for our high-pressure resistance measurements (Fig. la). At 
slightly higher pressures, however, an unexpected superconducting 
phase re-emerges with an onset T, as high as 48.0K at 12.4GPa 
(Fig. 1b). The sample is not superconducting at pressures higher than 
13.2 GPa. We repeated the measurements with new samples in three 
independent experiments, and the results were reproducible. 


To confirm the pressure-induced changes of superconductivity in 
Tlo.6Rbo 4Fe.67Se2, we also performed magnetic alternating-current 
susceptibility measurements in situ at high pressures (Fig. 2). The value 
of T- is taken to be the onset of superconductivity defined by the 
intersection of a line drawn through the steep slope of the curve and 
the region of zero slope above the transition. The magnetic study 
showed that T, decreased with increasing pressure and vanished at 
9.8 GPa in the first superconducting phase SC-I (Fig. 2a). With further 
increasing pressure, the material enters a new superconducting phase 
SC-II and its transition temperature reaches 40.2K at 12.2GPa 
(Fig. 2b). The magnetic measurements yield T. values consistent with 
the resistivity data within the experimental uncertainties. These results 
provide convincing evidence for the existence of two distinct super- 
conducting phases in Tl 6Rbo,.4Fe) 67Se2. 
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Figure 1 | Temperature-dependence of electrical resistance for 
Tlo.6Rbo.4Fe1.67Se2 at different pressures. a, Resistance-temperature curves in 
the initial superconducting phase (SC-I) up to 9.4 GPa. T,, was observed to shift 
to lower temperature with increasing pressure. Superconductivity disappears at 
9.4 GPa. b, Electrical resistance curves for the same single crystal at higher 
pressures. A new superconducting state re-emerges upon further compression. 
The pressure-induced superconducting phase (SC-II) has a T, of 48 K, which is 
much higher than the maximum in SC-I. Cryogenic resistance measurements 
were performed in a diamond-anvil cell. Diamond anvils with 600-1m and 300- 
jum tip flats were used with sample chambers of diameter 300 um and 100 tum, 
respectively. Four electrical leads were attached to the single-crystal sample 
insulated from the rhenium gasket, and loaded into the sample chamber. NaCl 
powders were employed as a pressure medium. The ruby fluorescence method 
was used to gauge pressure”. 
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Figure 2 | Temperature dependence of the alternating- 

current susceptibility for Tlp,;Rbo.4Fe;.67Se, at different pressures. 

a, Superconducting transitions observed in the real susceptibility component of 
the sample at pressures of 2.5, 5.4 and 9.8 GPa in SC-I. The superconducting 
transition shifts downward to lower temperature with increasing pressure. At 
9.8 GPa the susceptibility component remains constant upon cooling down to 
4K, indicating that the sample is no longer superconducting. b, The real 
component of the susceptibility versus temperature for the crystal in SC-II at a 
pressure of 12.2 GPa. The inset shows the set-up for alternating-current 
susceptibility measurements in a diamond-anvil cell, with a signal coil around 
the diamond anvils and a compensating coil. The alternating-current 
susceptibilities were detected within a lock-in amplifier*'. The crystals were 
loaded into the sample chamber, which is a hole in the centre of the 
nonmagnetic gaskets, with Daphne 7373 as the pressure medium. 


To investigate whether the pressure-induced re-emergence of super- 
conductivity was unique to Tlo.6Rbp,4Fe;,67Se2 or more general among 
iron chalcogenides, we conducted parallel electrical resistance measure- 
ments on Ko Fe; 7Se2 single crystals, and observed nearly identical 
behaviour (Fig. 3). The initial T. of 32K at 0.8-1.6GPa decreased 
monotonically with increasing pressure and became undetectable at 
9.2 GPa. At a slightly increased pressure, the second superconducting 
phase of Ko gFe; 7Se, re-emerged and reached the maximum T- of 48.7 K 
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Figure 3 | Temperature dependence of the resistance for Ko,sFe,7Se, at 
different pressures. a, SC-I. The resistance-temperature curves showing the T, 
reduction with increasing pressure and its disappearance at 9.2 GPa. b, SC-II. 
The resistance measurements reveal another superconducting phase above 
10.5 GPa. The T, reaches 48.7 K at 12.5 GPa and disappears at 13.2 GPa. The 
black curve has been multiplied by 100. 
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at 12.5GPa. We repeated the experiment six times using six single 
crystals cut from different batches, and the results were reproducible. 
We further repeated the measurements with a slightly different com- 
position, Kp gFe;7gSe2, and again, observed similar pressure-induced 
behaviour. 

Wesummarized the pressure dependence of T. of Tlp,¢Rbo 4Fe; 67Se2, 
Ko.gFe) 7Se2, and Ko gFe; 7gSe in Fig. 4and Supplementary Tables 1 to 4. 
The diagram clearly reveals two distinct superconducting regions: the 
initial superconducting phase SC-I and the pressure-induced super- 
conducting phase SC-II. In the SC-I region, T- is suppressed with 
applied pressure and approaches zero between 9.2 and 9.8 GPa. At 
higher pressures, the SC-II region appears, in which the T, is even 
higher than the maximum T, of the SC-I region. The SC-II region 
has a maximum T. of 48.7K for KogFe,7Se, and 48.0K for 
Tlo.6Rbo 4Fe;.67Se2, higher than previously observed in chalcogenide 
superconductors. The SC-II region appears in a narrow pressure 
range. Unlike the usual parabolic pressure-tuning curve of T,, the high 
T, in SC-II appears abruptly above 9.8 GPa and disappears equally 
abruptly above 13.2GPa. Intermediate T.< 38K is not observed 
even with small pressure increment steps of 0.1GPa. A similar 
re-emergence of superconductivity has been observed in some other 
strongly correlated electronic systems, such as heavy-fermion’? and 
organic systems’. 

Our preliminary high-pressure polycrystalline X-ray diffraction 
results of the two iron chalcogenides Ko gFe;.7Se. and Ko Fe 7gSe2 
confirm that, to the first degree, the basic tetragonal crystal structure 
persists throughout the pressure range studied (Supplemen- 
tary Information). Therefore, the disappearance of T- in SC-I, the 
re-emergence of higher T in SC-IL, and the final non-superconducting 
region reflect detailed structural variances within the basic tetragonal 
unit cell, which await future in-depth investigation with advanced 
diagnostic probes. For instance, the possible change in magnetic- 
ordering structures would require high-pressure neutron diffraction, 
and the possible superlattice and Fe vacancy ordering would require 
high-pressure single-crystal X-ray structural investigations. 

The pressure dependence of T, in the SC-I region is expected but its 
mechanism is still much debated. Quantum criticalities are thought to 
affect superconductivity for strongly correlated electronic systems’’. A 
characteristic feature of the new iron chalcogenide superconductors is 
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Figure 4 | Pressure dependence of the T, for Tlo,¢Rbo.4Fey 67525 Ky gFe1.7Se2 
and Ko gFe;.7gSe2. The symbols represent the pressure-temperature 
conditions for which T- values were observed from the resistive and alternating- 
current susceptibility measurements; symbols with downward arrows represent 
the absence of superconductivity to the lowest temperature (4K). All 
Tlo6Rbo.4Fe1 67Se2, Ko.gFey.7Se2 and Ky gFe;,7gSe, samples show two 
superconducting regions (SC-I and SC-II) separated by a critical pressure at 
around 10 GPa. NSC, the non-superconducting region above 13.2 GPa. The 
maximum T, is found to be 48.7 K in Ko. gFe; 7Se. at a pressure of 12.5 GPa. At 
higher pressures above 13.2 GPa, the samples are non-superconducting. Error 
bars are one standard deviation. 
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the existence of Fe-vacancies in the Fe-square lattice, ordered by a 
(5x 5 superstructure’. It remains unclear whether pressure could 
destroy the vacancy ordering at a critical value and drive the materials 
into a disordered lattice. Detailed structural studies of these super- 
conducting behaviours in the iron chalcogenide superconductors are 
currently being conducted. Their magnetic properties at high pres- 
sures should help us to understand the interplay of magnetism and 
superconductivity in these iron chalcogenides. 

This observation of the SC-II region with the re-emerging higher Tis 
unexpected. It will certainly stimulate a great deal of future experimental 
and theoretical studies to clarify whether the observed re-emergence of 
superconductivity in iron chalcogenides is associated with the quantum 
critical transition, magnetism, superstructure, vacancy ordering or spin 
fluctuation. 
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The sirtuin SIRT6 regulates lifespan in male mice 


Yariv Kanfi', Shoshana Naiman'*, Gail Amir*, Victoria Peshti!, Guy Zinman’, Liat Nahum!, Ziv Bar-Joseph* & Haim Y. Cohen! 


The significant increase in human lifespan during the past century 
confronts us with great medical challenges. To meet these challenges, 
the mechanisms that determine healthy ageing must be understood 
and controlled. Sirtuins are highly conserved deacetylases that have 
been shown to regulate lifespan in yeast, nematodes and fruitflies'. 
However, the role of sirtuins in regulating worm and fly lifespan has 
recently become controversial’. Moreover, the role of the seven 
mammalian sirtuins, SIRT1 to SIRT7 (homologues of the yeast 
sirtuin Sir2), in regulating lifespan is unclear’. Here we show that 
male, but not female, transgenic mice overexpressing Sirt6 (ref. 4) 
have a significantly longer lifespan than wild-type mice. Gene 
expression analysis revealed significant differences between male 
Sirt6-transgenic mice and male wild-type mice: transgenic males 
displayed lower serum levels of insulin-like growth factor 1 
(IGF1), higher levels of IGF-binding protein 1 and altered 
phosphorylation levels of major components of IGF1 signalling, a 
key pathway in the regulation of lifespan*. This study shows the 
regulation of mammalian lifespan by a sirtuin family member 
and has important therapeutic implications for age-related diseases. 

Sirtuins are highly conserved NAD* -dependent deacetylases that 
have been shown to regulate lifespan in several organisms. Increasing 
the sirtuin level through genetic manipulation extends the lifespan of 
yeast, nematodes and flies’. Yet, despite many publications supporting 
a pro-longevity role for sirtuins, there has been recent debate about the 
direct role of Caenorhabditis elegans and Drosophila melanogaster SIR- 
2 in ageing and lifespan extension in response to calorie restriction 
(also known as dietary restriction)**’”. Some mammalian sirtuins have 
been shown to regulate age-related diseases, but mice that overexpress 
SIRT1 have the same lifespan as control, wild-type (WT), mice®. Thus, 
the role of SIRT1 and other mammalian sirtuins in regulating mam- 
malian lifespan is unclear’. 

Several key findings support a potential role for SIRT6 in regulating 
mammalian lifespan. SIRT6-deficient mice are small and have severe 
metabolic defects, and by 2-3 weeks of age, they develop abnormalities 
that are usually associated with ageing’. In addition, SIRT6 regulates 
nuclear factor-«B signalling, which controls ageing-associated changes 
in gene expression’®. Recently, we showed that SIRT6 levels increase in 
rats that are fed a calorie-restricted diet'!, and transgenic mice that 
overexpress exogenous mouse SIRT6 (Sirt6-transgenic mice; also 
known as MOSES mice)* are protected against the physiological 
damage caused by diet-induced obesity, including triglyceride and 
low-density-lipoprotein-associated cholesterol accumulation in the 
serum, increased body fat and reduced glucose tolerance. In normal 
animals, these metabolic defects become apparent by middle age, 
whereas their appearance is delayed in animals fed a calorie-restricted 
diet. Thus, in this study we sought to determine whether Sirt6- 
transgenic mice remain healthy for longer and have a longer lifespan 
than wild-type mice. 

The lifespan of Sirt6-transgenic mice was examined in comparison 
to their control littermates. Sirt6-transgenic mice were produced on a 
segregating stock containing equal contributions from C57BL/6J and 
BALB/cOlaHsd mouse strains, both of which are considered to be long 


lived’. The study was carried out on 245 mice (119 males and 126 
females) from two transgenic lines (line 55 and line 108) generated 
from two separate founders. Log-rank test analysis showed significant 
differences in the survival curves between male WT and male trans- 
genic mice, but not between female WT and female transgenic mice, 
for both lines (Fig. la-d and Supplementary Table 1). Relative to male 
WT littermates, the median lifespan of male Sirt6-transgenic mice 
increased by 14.5% and 9.9%, and the mean lifespan increased by 
14.8% and 16.9%, for line 55 and 108, respectively (log-rank test, 
7 =10.529, df=1 and P=0.001 for line 55; and 7° = 4.225, 
d.f. = 1 and P = 0.040 for line 108). In female Sirt6-transgenic mice, 
no significant increase in median or mean lifespan was found relative 
to female WT littermates for either line (log-rank test, ¢ = 0.009, 
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Figure 1 | Extended lifespan of male Sirt6-transgenic mice. Kaplan-Meier 
survival curves for male and female WT and Sirt6-transgenic (Sirt6-tg) mice 
from two transgenic lines, line 55 (a, b) and line 108 (c, d). P values were derived 
from log-rank calculations. Glucose tolerance testing was carried out in WT 
and Sirt6-transgenic males (e) and females (f) at 19 months (572-577 days) of 
age (males, n = 6 per genotype; females, n = 4 per genotype). The area under 
the curve (AUC) for each glucose tolerance test is shown on the right (e, f y axis 
values shown are the AUC divided by 1,000 ). The values shown are 

mean = s.e.m. *, P< 0.05 (two-tailed t-test). 
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df.=1 and P=0.924 for line 55; and va = 0.993, df.=1 and 
P=0.319 for line 108). Relative to WT littermates, the maximum 
lifespan of transgenic males (that is, the mean lifespan of the oldest 
10% of a cohort to die) increased by 15.8% and 13.1% for line 55 and 
108, respectively. Comparison of the maximum lifespan of WT and 
Sirt6-transgenic mice using the quantile regression approach at the 
ninetieth percentile’? showed a significant difference between males in 
one line only (P = 0.03 and P = 0.11 for line 55 and 108, respectively) 
and no difference for females (P = 0.45 and P = 0.67 for line 55 and 
108, respectively). Cox regression analysis (using the stepwise back- 
ward, Wald method) with the recruitment date, parental identity, 
gender, genotype and mouse line as main effects and line-by-genotype 
as the interaction variable showed an additive effect of genotype and 
line (Supplementary Table 2). However, there was no interaction 
between mouse line and genotype (P = 0.693), indicating that SIRT6 
overexpression had an equivalent effect on the mortality of both lines. 
In summary, our data show that SIRT6 overexpression increased the 
longevity of males but not females. 

SIRT6 has been shown to regulate genomic stability and metabolism*”, 
two important contributors to longevity. Loss of genomic stability is 
known to be an important aspect of cancer. Post-mortem gross and 
microscopic examination of the WT and transgenic mice revealed 
malignant tumours in a variety of organs, with the highest incidence 
of tumours in all mice being in the lungs. No significant differences in 
tumour spectrum or incidence were found between WT and transgenic 
mice (Supplementary Table 3). Similarly, pathological analysis 
revealed no differences between WT and transgenic males in the 
incidence of non-neoplastic findings (for example, diffuse mesangial 
sclerosis and pulmonary emphysema) or age-related pathologies (for 
example, femoral osteoporosis, basal ganglia calcification and adrenal 
cortical hyperplasia) (data not shown). Interestingly, the median life- 
span of Sirt6-transgenic mice with lung tumours showed a trend 
towards being longer (by 11.7%) than that of WT mice with lung 
tumours. Therefore, the hypothesis that the effect of SIRT6 on lung 
cancer has a role in SIRT6’s pro-longevity effect cannot be entirely 
excluded. However, given the proportion of mice with lung tumours 
in each genotype, a protective role of SIRT6 against lung cancer is likely 
to contribute only partially to the pro-longevity effect (Supplementary 
Information). Thus, further studies are required to evaluate the con- 
tribution of SIRT6 to age-sensitive traits, in addition to its effect on 
lung cancer. 

The protective role of SIRT6 against metabolic disorders that are 
induced by a high-fat diet* suggests that SIRT6 might positively affect 
age-associated metabolic disorders, such as declining insulin sensitivity 
and impaired glucose tolerance. No significant differences in glucose 
metabolism were found between young (4-7 month old) WT and Sirt6- 
transgenic mice (data not shown). However, an intraperitoneal glucose 
tolerance test showed that old Sirt6-transgenic mice (19 months old, the 
maximum age of WT mice before a considerable proportion of the litter 
died) displayed a trend towards improved glucose homeostasis com- 
pared with WT mice of the same age (Fig. le, f). An analysis of variance 
(ANOVA) test for the area under the curve (AUC) values of the glucose 
tolerance tests indicated no sex-specific effect but showed a significant 
effect of genotype (P = 0.016). Therefore, although SIRT6 overexpres- 
sion had a positive effect on glucose homeostasis in old mice, this 
finding cannot explain the sexual dimorphism in longevity. 

To understand further the mechanisms of the gender-specific life- 
span extension in Sirt6-transgenic mice, we used whole genome micro- 
array analysis to examine differential gene expression in the livers of 
animals of both sexes (Supplementary Table 4). In agreement with the 
sexual dimorphism in liver gene expression”, differential expression 
analysis using Significance Analysis of Microarrays (SAM) software’ 
showed that the most extensive gene expression differences occurred 
between genders (Supplementary Table 5). Notably, significant differ- 
ences were also found between Sirt6-transgenic and WT males, but the 
differences between Sirt6-transgenic and WT females were minor 
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(Supplementary Table 5). ANOVA analysis uncovered a subset of 
genes whose expression differed significantly between genotypes and 
that were gender-specific (Supplementary Table 5). Gene Ontology 
(GO) functional analysis showed that the differentially expressed gene 
set between Sirt6-transgenic males and WT males is significantly 
enriched for categories related to metabolism and cellular responses 
(Supplementary Table 6). We next compared this differentially 
expressed gene set with the set of genes that was differentially 
expressed between male and female WT mice. This analysis revealed 
a significant similarity between the two gene sets. Of the differentially 
expressed genes in Sirt6-transgenic males, 50% (41 of 82) were also 
differentially expressed between male and female WT mice (P = 0) 
(Fig. 2a, b and Supplementary Table 5). 

To confirm the microarray results, 11 of the differentially expressed 
genes in male Sirt6-transgenic mice were selected for validation by 
quantitative PCR. The expression pattern of all of these 11 genes con- 
firmed the microarray data (Figs 2c and 3c). Moreover, to examine 
whether the transcriptional changes due to SIRT6 are mouse-line-spe- 
cific, the expression of several of these genes was followed in another 
transgenic line, and the same pattern of transcriptional changes was 
observed (Supplementary Fig. 1). Calorie restriction and starvation'’®° 
have previously been shown to have a similar effect to SIRT6 overexpres- 
sion on the transcription of several genes (30% of the differentially 
expressed genes in male Sirt6-transgenic mice showed a similar 
expression pattern in male mice fed a calorie-restricted diet'®). For 
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Figure 2 | Expression profile of differentially expressed genes in male Sirt6- 
transgenic mice. a, b, Heat maps displaying the significantly upregulated (red) 
and downregulated (green) genes in Sirt6-transgenic males (m.TG) compared 
with WT males (m.WT). The expression profile of these genes in WT females 
(£WT) or Sirt6-transgenic females (f'TG) compared with WT males is also 
illustrated. Statistical analysis was performed using all 24 arrays. The 
quantitative-PCR-validated genes are shown in bold. c, The relative expression 
levels of hepatic genes were confirmed by quantitative PCR in 20 male and 20 
female mice. The values shown are mean = s.e.m. *, P< 0.05; **, P< 0.01; 
n= 10 per group. RQ, relative quantification. 
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example, the upregulated genes Lpin1, Lpin2, Gadd45g, Fkbp5, Dusp1 
and Cebpd and the downregulated genes Vnn1, Vnn3, Pctp, Vidlr, Car3 
and G0s2 in the expression profile of male Sirt6-transgenic mice are 
also differentially expressed in the livers of mice fed a calorie-restricted 
diet’*"*. 

A key factor in the regulation of lifespan is the IGF1 signalling 
pathway. Worms and flies with a mutated insulin/IGF1 receptor and 
mice that are heterozygous for the IGF1 receptor have an extended 
lifespan’. Moreover, rodents fed a calorie-restricted diet have lower 
IGF] levels early in life than rodents fed a normal chow diet, and many 
rodent genetic models with a prolonged lifespan have lower levels of 
serum IGF1 or IGF] signalling than do control groups*”. Although no 
difference was found between WT and Sirt6-transgenic females, young 
transgenic males (6 months old) had lower serum IGF1 levels than WT 
male littermates (Fig. 3a), and these IGF1 levels in Sirt6-transgenic 
males were similar to those in all females. This significant difference in 
IGF1 levels between young transgenic and WT males was sustained 
until 19 months of age (Fig. 3b). In line with this finding, one of the 
genes that was highly upregulated in Sirt6-transgenic males, to the 
same levels as in WT or Sirt6-transgenic females, was the gene encod- 
ing IGF-binding protein 1 (IGFBP1) (Fig. 3c). IGFBP1 is thought to be 
the main short-term modulator of IGF1 bioavailability*’. Calorie 
restriction increases the expression of IGFBP1 (ref. 17), and high 
levels of IGFBP1 correlate with protection against metabolic 
disorders”. No change was found in the expression of gene encoding 


LETTER 


other IGF1-binding proteins, such as IGFBP3 and acid-labile subunit 
(ALS; also known as IGFALS) (Fig. 3c). 

To follow the changes in IGFI signalling, components of this 
pathway were analysed in the three main metabolic tissues: liver, white 
adipose tissue (WAT) and muscle. Analyses included the phosphor- 
ylation levels of AKT activation sites (Thr 308 and Ser 473), FOXO1 
(Thr 24) and FOXO3 (Thr 32). The most significantly decreased 
phosphorylation levels were observed in the perigonadal WAT of 
Sirt6-transgenic males in comparison to WT males (Fig. 3d—g and 
Supplementary Fig. 2a—d). The levels of phosphorylated AKT (on both 
activation sites), FOXO1 and FOXO3 in WAT were lower in the 
transgenic mice (Fig. 3e, f). Therefore, we further explored this 
pathway in WAT and found that the phosphorylation levels of the 
IGF1 receptor (Tyr 1135) and S6 (Ser 235/236) were lower in Sirt6- 
transgenic males than in the WT male littermates (Fig. 3d, g). 
Importantly, no significant change in the phosphorylation levels of 
these markers was observed in female mice (Fig. 3d-g and Supplemen- 
tary Fig. 2a—d). Moreover, the decrease in the phosphorylation levels of 
AKT and FOXO proteins in male Sirt6-transgenic mice is in agree- 
ment with previous reports that show that lifespan is positively regu- 
lated by changes in IGF1 signalling in the whole organism, or 
specifically in the fat tissues, of nematodes and fruitflies*”’. 

There is much doubt about whether mammalian sirtuins regulate 
lifespan*°*. Moreover, in the fly and nematode, a recent study challenged 
the role of sirtuins in regulating lifespan, claiming that the increased 
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Figure 3 | Alterations in the IGF1-AKT pathway in Sirt6-transgenic males. and FOXO3 at Thr 32 (f), and S6 at Ser 235/236 (g) in perigonadal WAT (n = 4 


a, b, Serum IGF] levels in male and female WT and Sirt6-transgenic mice at 6 
months (a) and 19 months (b) of age (n = 4-7). ¢, The relative expression of 
hepatic Igfbp1, Igfbp3 and Als measured by quantitative PCR (n = 4-7). 

d-g, The phosphorylation levels of the IGF1 receptor (IGF1R) at Tyr 1135 
(d), AKT at both the Thr 308 and Ser 473 activation sites (e), FOXO1 at Thr 24 


mice per genotype). All mice were killed at the same time of day. The 
phosphorylated to unphosphorylated protein ratios, as determined by 
densitometry, are shown on the right. a~g, The values shown are the 
mean + s.e.m. *, P< 0.05; **, P< 0.01. 
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longevity observed in strains with SIR-2 overexpression is caused by 
differences in genetic background or by mutagenic effects of transgene 
insertion’. To address potential complications owing to strain-specific 
effects and integration sites, we used a segregating background with 
equal contributions from the C57BL/6J and BALB/cOlaHsd mouse 
strains and studied two separate lines. Indeed, we showed that SIRT6 
extends male lifespan regardless of the integration site (Supplementary 
Fig. 3) and in two control lines with different lifespans. Here, we reveal 
a role for the mammalian sirtuin SIRT6 in regulating lifespan. SIRT6 
overexpression extends lifespan only in males, potentially by reducing 
IGF1 signalling specifically in WAT. Mice with a fat-specific insulin 
receptor gene knockout have been shown to have an increased mean 
lifespan of similar magnitude to the male transgenic mice in our 
study’, demonstrating the central role of fat in regulating lifespan. 
Most genetic modifications of the IGF1 or insulin signalling pathway 
affect the lifespan of both genders or show a stronger effect in females. 
Yet here the effect of SIRT6 on IGF1 signalling was male specific. 
Therefore, further research is required to determine whether the effects 
of SIRT6 are blocked in females rather than enhanced in males. Taken 
together, our findings suggest that SIRT6 is an important regulator of 
mammalian longevity and indicate the feasibility of manipulating 
SIRT6 levels to treat age-related diseases. 


METHODS SUMMARY 


Sirt6-transgenic mice on the CB6F1 background, containing equal contributions 
from C57BL/6) and BALB/cOlaHsd mouse strains, were generated as described 
previously’, and the glucose tolerance tests and lifespan analyses were performed 
as described previously****. Tissues were taken after natural death, fixed in 
formaldehyde for histopathological analysis, embedded in paraffin, sectioned, 
and stained with haematoxylin and eosin. Quantitative PCR was performed using 
ABsolute Blue SYBR Green on a StepOnePlus instrument. Microarray sample 
labelling and hybridization were performed as previously described’, and data 
were normalized using the program dChip. Differentially expressed genes were 
identified using SAM and defined as those with a q value of <10.0% and a 
minimum of a 1.5 fold change. 
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Stability criteria for complex ecosystems 


Stefano Allesina’? & Si Tang! 


Forty years ago, May proved’ that sufficiently large or complex 
ecological networks have a probability of persisting that is close to 
zero, contrary to previous expectations’ *. May analysed large 
networks in which species interact at random’”*. However, in 
natural systems pairs of species have well-defined interactions 
(for example predator-prey, mutualistic or competitive). Here 
we extend May’s results to these relationships and find remarkable 
differences between predator-prey interactions, which are stabil- 
izing, and mutualistic and competitive interactions, which are 
destabilizing. We provide analytic stability criteria for all cases. 
We use the criteria to prove that, counterintuitively, the probability 
of stability for predator-prey networks decreases when a realistic 
food web structure is imposed’* or if there is a large preponderance 
of weak interactions””®. Similarly, stability is negatively affected by 
nestedness'’"“* in bipartite mutualistic networks. These results are 
found by separating the contribution of network structure and 
interaction strengths to stability. Stable predator-prey networks 
can be arbitrarily large and complex, provided that predator-prey 
pairs are tightly coupled. The stability criteria are widely applicable, 
because they hold for any system of differential equations. 

May’s theorem deals with community matrices'** M, of size S X S, 
where S is the number of species. Mj describes the effect that species j 
has on i around a feasible equilibrium point (that is, species have 
positive densities) of an unspecified dynamical system describing the 
species’ densities through time. 

In May’s work!?, the diagonal coefficients are —1, and the off- 
diagonal coefficients are drawn from a distribution with mean 0 and 
variance a” with probability C and are 0 otherwise. For these matrices, 
the probability of stability is close to 0 whenever the ‘complexity’ 
oV/SC>1. Local stability measures the tendency of the system to 
return to equilibrium after perturbations. In unstable systems, even 
infinitesimal perturbations cause the system to move away from 
equilibrium, potentially leading to the loss of species. Thus, it should 
be extremely improbable to observe rich (large S) or highly connected 
(large C) persistent ecosystems'”. Mathematically, an equilibrium 
point is stable if all the eigenvalues of the community matrix have 
negative real parts’. 

Local stability can only describe the behaviour of the system around 
an equilibrium point, whereas natural systems are believed to operate 
far froma steady state>'>. However, methods based on local stability are 
well suited to the study of large systems’'*'’, whose empirical para- 
meterization would be unfeasible. Moreover, the methods are general, 
so that they can be applied to any system of differential equations. 

May’s matrices have random structure: each pair of species interacts 
with the same probability. However, this randomness translates, for 
large S, into fixed interaction frequencies, so that these matrices follow 
a precise mixture of interaction types. For example, in May’s matrices 
predator-prey interactions are twice as frequent as mutualistic ones 
(Supplementary Table 1). Here we extend May’s work to different 
types of interaction, starting from the random case. 

Suppose that two species j and i interact with probability C; and that 
the interaction strength is drawn from a distribution: Mj takes the 
value of a random variable X with mean E(X)=0 and variance 


Var(X) = o*. The diagonal elements of the community matrix, repre- 
senting self-regulation, are set to —d. For large systems, the eigenvalues 
are contained in a circle’* in the complex plane (Fig. 1 and Supplemen- 
tary Information). The circle is centred at (—d,0) and the radius is 
oV/SC. In stable systems, the whole circle is contained in the left half- 
plane (that is, all eigenvalues have negative real parts). Thus, the 
system is stable when the radius is smaller than d: SC <0=d/c. 

In predator-prey networks, interactions come in pairs with opposite 
signs: whenever M, a then Mii <0. With probability C, we sample one 
interaction strength from the distribution of |X| and the other from —|X|, 
whereas with probability (1 — C) both are zero. The eigenvalues of large 
predator-prey matrices are contained in a vertically stretched ellipse’’, 
centred at (—d, 0), with horizontal radius oVSC (1 —F (|x |) / o ) and 
thus the stability criterion is VSC<0/(1 —F(|X|)/o ) (Fig. 1 and 
Supplementary Information). 

When we constrain My and Mji to have the same sign, and thus 
impose a mixture of competition and mutualism with equal probability, 
the eigenvalues are enclosed in a horizontally stretched ellipse’? and 
the criterion becomes VSC <0/(1+E?(|X|)/o?) (Fig. 1 and Sup- 
plementary Information). 

Take C = 0.1, X ~ N(0, 1/4) (that is, X follows a normal distribution 
with mean 0 and variance 1/4), and d=1. The criterion becomes 
JSC <2 for random matrices, and is violated whenever S = 41. For 
predator-prey we find WSC <2n/(n—2) (violated for S = 303) and 
for the mixture of competition and mutualism SC <2n/(n+2) 
(violated for $= 15). Since E(|X|)/o>0 for any distribution of X, 
the stability criteria form a strict hierarchy in which the mixture matrices 
are the least likely to be stable, the random matrices are intermediate, 
and the predator-prey matrices are the most likely to be stable (Fig. 2 
and Table 1). Considerations based on qualitative stability? and 
numerical simulations’® are consistent with this hierarchy. 

In the three cases above, the mean interaction strength is zero, and 
the coefficients come from the same distribution. In fact we can shuffle 
the interaction strengths, thereby transforming a network of one type 
into another: the difference in stability is driven exclusively by the 
arrangement of the coefficients in pairs with random, opposite and 
same signs, respectively. This feature allows us to further derive the 
stability criteria for all intermediate cases by using linear combinations 
of the three cases above (Supplementary Information). 

Two ecologically important cases, however, cannot produce a mean 
interaction strength of zero. In mutualistic networks all interactions 
are positive, whereas in competitive networks they are negative. In 
these cases, for large systems, all the eigenvalues except one (equal to 
the row sum) are contained in an ellipse (Fig. 3 and Supplementary 
Figs 1 and 2). In mutualistic networks in which all interaction pairs are 
positive and drawn from the distribution of |X| independently with 
probability C, the stability criterion becomes (S—1)CE(|X|)/o0<0 
(that is, row sum < 0; Supplementary Information). For competitive 
matrices, in which interaction pairs are drawn from the distribution of 
—|X| with probability C, the criterion is 


VSC(1+(1—2C) Exp/e*) / 1—CE*(|X|) /o* + CE(|X|)/0 <0 
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Figure 1 | Distributions of the eigenvalues and corresponding stability 
profiles. a, For X ~ N(0, a’), S= 250, C= 0.25 and o = 1, we plot the 
eigenvalues of 10 matrices (colours) with —d = —1 on the diagonal and off- 
diagonal elements, following the random, predator-prey or mixture 
prescriptions. The black ellipses are derived analytically in the text. 

b, Numerical simulations for the corresponding stability profiles. For the 
random case, starting from S = 250, C= 0.5, g = 0.1 and d= 1, we 
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Figure 2 | Stability criteria for different types of interaction. We fixed 

0 = d/o = 4, and for a given connectance C we solved for the largest integer S 
that satisfies the stability criterion for each type of interactions. Combinations 
of S and C below each curve lead to stable matrices with a probability close to 1. 
The interaction types form a strict hierarchy from mutualism (most unlikely to 
be stable) to predator-prey (most likely to be stable). 
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systematically varied C (crosses) or (plus signs) to obtain «SC spanning 
(0.5, ...,1.0,..., 1.5] of the critical value for stability (indicated in red, 1 in the 
case of random matrices). The profiles were obtained by computing the 
probability of stability out of 1,000 matrices. The predator-prey case is as the 
random but with o = 0.5 and critical value m/(m — 2). The mixture case is as the 
random but with critical value m/(m + 2). In all cases, the phase transition 
between stability and instability is accurately predicted by our derivation. 


(Supplementary Information). In both cases, stability decreases rapidly 
with higher complexity, and mutualistic matrices are less likely to be 
stable than their competitive counterpart (Fig. 2 and Table 1). 

Having derived the stability criteria, we want to assess the effect of 

imposing realistic food web structure within the predator-prey case. It is 
believed that realistic food web structures should improve stability’*"”. 
In community matrices of food webs, producers have positive columns 
and negative rows, with the opposite for top predators. To test whether 
these variations affect stability, we plotted the eigenvalues for predator- 
prey webs in which interactions are arranged, following the cascade” 
and niche*' models. Imposing realistic structures results in eigenvalues 
with larger real parts than the corresponding unstructured predator- 
prey case (Supplementary Information and Supplementary Fig. 3). 
Thus, the cascade and niche models produce networks that are less likely 
to be stable than their unstructured predator-prey counterpart, with the 
niche model having a larger discrepancy: imposing realistic food web 
structure hampers stability. 

Similarly, we measured the effect of realistic structures on mutualistic 
networks. Several published mutualistic networks are bipartite’™™: 
there are two types of node (for example plants and pollinators), and 
interactions occur exclusively between different types. In addition, 
bipartite mutualistic networks tend to be nested"’: the interactions of 
the specialists form a subset of those of the generalists. Nestedness is 
believed to beget stability'” *. We plotted the eigenvalues for these two 
types of structure and compared the results with those obtained for the 
unstructured mutualistic case (Fig. 3, Supplementary Information and 
Supplementary Fig. 4). As stated above, stability in mutualistic networks 
is determined by the row sum. The bipartite case yields row sums that, 
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Table 1 | Stability criteria for different types of interaction and network structure 


Sinax(C, 0) 
nteraction Stability criterion (0.1, 2.0) (0.1, 4.0) (0.2, 4.0) 
ested mutualism 9 28 18 
utualism 16(15 41 (51 22 (20 
(s-1)c/2<0 (15) (51) (20) 
Bipartite mutualism 17 41 23 
ixture VEC <_ " . 17 (14) 58 (59) 33 (29) 
Competition 1715 62 (63 38 (33 
p Jse(14.2.— 2), /* 2 .cf2<6 (15) (63) (33) 
m — 2C T 
Random VSC <0 50 (40) 168 (160) 88 (80) 
iche predator-prey 149 461 245 
Cascade predator-prey 298 1,134 535 
P. _ 
redator-prey VEC <_ " 5 314 (302) 1,201 (1,211) 603 (605) 


nall cases, the criterion is derived for large S x S matrices with X ~ N(O, «?) (and thus E(|X|) =o/ 2/m), connectance C and @ = d/o. Numerical simulations report, for a given combination of C and 6, the largest S 


(Smax) yielding a probability of stability = 0.5 (computed using 1,000 matrices). In parenthesis are the analytical predictions. 


for large S, are equal to the unstructured case. Accordingly, we did not 
find a discrepancy in stability for the bipartite case. However, in nested 
structures some rows and columns have sums that are larger than 
average (generalist plants and animals). Consequently, nested matrices 
are inherently less likely to be stable than unstructured ones. These 
findings are confirmed by numerical simulations. Using the same 
method, we found that asymmetric coupling of interaction strengths 
(where each large Mj is coupled with a small Mj;), contrary to current 
expectations”, does not influence stability in mutualistic networks 
(Supplementary Information and Supplementary Fig. 5). 

We have considered how the arrangement of the interactions affects 
stability, and have found several counterintuitive results. These results 
can be accounted for by the fact that we provide a very conservative test 
for the effects of structure on stability (Supplementary Information). 
We now assess the role of the magnitude of interaction strengths. In 
fact, our findings extend to any distribution of coefficient strengths 
(Supplementary Information). 

Typically, ecologists have regarded o as the ‘average interaction 
strength’. However, o does not provide information on weak inter- 
actions”'®”: we can have the same g for two distributions with distinct 
shapes, and thus different proportions of weak and strong interactions 
(Supplementary Information). We analyse how the shape of the dis- 
tribution affects stability for fixed S, C, d and o. If the distribution 
contains many weak interactions, the expected magnitude E(|X|)~0. 
In contrast, if weak interactions are rare, E(|X|)~o. In the predator- 
prey systems, lowering E(|X|) decreases 0/ (1— E*(|X|)/ a”) and thus 


hampers stability. We conclude that weak interactions, contrary to 
current beliefs”’*”’, can destabilize predator-prey systems. Weakening 
the interactions shifts E(|X|) closer to zero and therefore makes 
predator-prey systems closer to their random counterpart. With the 
same argument, weak interactions can stabilize the mixture of competi- 
tion and mutualism case and have no effect on random networks. 
Variability in interaction strengths was previously found to be 
detrimental for stability in large food webs”’ and competitive networks’”. 

For example, consider a uniform distribution X ~ Ul —o 3,03] 
and contrast it with the normal case X ~ N(0, 0”). Both parameteriza- 
tions lead to E(X)=0 and Var(X)=o7. In the uniform case, 


E(|X|) =o/3/2~0.8660, whereas in the normal case E(|X|) = 


oy/2 / 1t~0.798 6. This means that the uniform distribution, on 
average, leads to stronger interactions than the corresponding normal 
case. In turn, this has a large effect on stability: the criterion for the 
predator-prey case becomes /SC<40 for the uniform distri- 
bution, whereas it is WSC <1/(n—2)0~2.75 0 for the normal case. 
The random case is unaffected by the choice of the distribution 
(\/SC <0), whereas in the mixture of competition and mutualism 
we have VSC<40/7~0.5710 for the uniform distribution and 
VSC <10/(m+2)~0.61 0 for the normal case. These considerations 
extend to any choice of distribution for the interaction strengths 
(Supplementary Information and Supplementary Figs 6 and 7): weak 
interactions, all other things being equal, are destabilizing for food 
webs, stabilizing for mutualistic and competitive networks (and their 
mixture), and have no effect on random networks. 
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Figure 3 | Distribution of the eigenvalues for the three types of mutualism. 
a, Unstructured mutualism. b, Bipartite mutualism. c, Nested and bipartite 
mutualism. In all cases, S = 250, o = 0.1, C= 0.2 and d= 1. Note that the 
bipartite case does produce extreme negative real eigenvalues (green arrow) 
coupled with positive ones, but the row sum (and thus the rightmost eigenvalue, 


red arrow) is equal to that of the unstructured mutualistic case. The nested 
matrices, in which generalist species yield (on average) larger row and column 
sums, have larger rightmost eigenvalues. Thus, highly nested matrices are less 
likely than the other two cases to be stable. 
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We have derived stability criteria for unstructured networks in 
which species interact at random, in predator-prey, mutualistic, and 
competitive pairs. These results hold for arbitrary diagonal values and 
arbitrary distribution of interaction strengths (Supplementary 
Information). Our analysis shows that, all other things being equal, 
weak interactions can be either stabilizing or destabilizing depending 
on the type of interactions between species. In predator-prey systems, 
realistic structure and weak interactions are detrimental for stability. 
However, in natural food webs, which seem to persist in time, weak 
interactions are preponderant™*. The persistence of these networks 
might be explained by the interplay between their structure and weak 
interactions, even though each would be destabilizing if taken in 
isolation. For example, as suggested previously’, generalist predators 
could have weak interactions with their numerous prey, reducing the 
effect of the realistic structure and driving the system closer to the 
unstructured case. 

Predator—prey systems differ markedly from the other cases studied 
here. Suppose that a network is unstable. The system can be stabilized 
either by lowering C, S or o (decreasing its complexity), or by increas- 
ing the self-regulation d. This is in line with May’s argument: large and 
highly interconnected systems are difficult to stabilize. For random 
networks, reducing complexity is the only way to stabilize the system. 
However, in the other cases, networks can be stabilized by altering the 
distribution of interaction strengths; by modifying the parameters of 
the system we can typically change the distribution of the off-diagonal 
elements without altering the diagonal ones (Supplementary Informa- 
tion). For competition, mutualism and their mixture, stability is 
achievable by decreasing the average interaction strength E(|X|), 
which is akin to lowering complexity. On the contrary, predator-prey 
networks can be stabilized by increasing the strength of interaction 
E(|X|), and thus the coupling between predators and prey. Predator- 
prey systems are therefore the only ones that can potentially elude 
May’s conclusions'” and support an arbitrarily large, complex and 
stable ecological network. 

Our results show that the ubiquity of consumer-resource relation- 
ships in nature could be due to their intrinsic dynamical properties. 
These findings are not limited to ecological networks, but instead hold 
for any system of differential equations resting at an equilibrium point. 
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Gain control by layer six in cortical 


circuits of vision 


Shawn R. Olsen!*, Dante S. Bortone!*, Hillel Adesnik! & Massimo Scanziani! 


After entering the cerebral cortex, sensory information spreads through six different horizontal neuronal layers that are 
interconnected by vertical axonal projections. It is believed that through these projections layers can influence each 
other’s response to sensory stimuli, but the specific role that each layer has in cortical processing is still poorly 
understood. Here we show that layer six in the primary visual cortex of the mouse has a crucial role in controlling the 
gain of visually evoked activity in neurons of the upper layers without changing their tuning to orientation. This gain 
modulation results from the coordinated action of layer six intracortical projections to superficial layers and deep 
projections to the thalamus, with a substantial role of the intracortical circuit. This study establishes layer six as a 
major mediator of cortical gain modulation and suggests that it could be a node through which convergent inputs 
from several brain areas can regulate the earliest steps of cortical visual processing. 


Primary sensory areas in the cerebral cortex are composed ofa stack of 
six neuronal layers’. Anatomical and physiological data indicate that 
these layers are interconnected through vertical excitatory axons*°, 
suggesting that sensory processing in any given layer may be modu- 
lated by activity in several other layers. However, so far the exact 
contribution of each layer to cortical processing is unclear. 

Here we address the role of layer six (L6) in mouse visual cortex, 
whose excitatory neurons not only project to more superficial layers 
but also to the primary sensory thalamic nuclei*”""’, the main source 
of sensory input to the cortex (Fig. 1a). L6 may thus influence cortical 
sensory responses directly through intracortical projections and 
indirectly through corticothalamic projections. Corticothalamic pro- 
jections were reported to be both suppressive and facilitatory on 
thalamic activity, depending on the precise alignment between L6 
and thalamic neurons (for reviews see refs 12-16). By contrast, how 
sensory responses in cortex are affected by L6 activity has remained 
largely unexplored’”"*. Furthermore, the relative contribution of 
intracortical versus corticothalamic projections in modulating cor- 
tical responses is currently unknown. The paucity of information is 
due to the lack of experimental tools for selectively manipulating 
activity in L6 without directly perturbing other cortical layers. 


L6 neurons of the Ntrsl-Cre GN220 line 


To control the activity of L6 we took advantage of a Cre-recombinase 
Bac transgenic mouse line that is reported to selectively label L6 neurons 
(NTSR1-Cre GN220)”. In the forebrain of these mice Cre expression 
was restricted to excitatory L6 neurons of the cerebral cortex (Fig. 1b 
and Supplementary Fig. 1). In primary visual cortex (V1) these neurons 
represented ~65% of the L6 excitatory neuronal population and, con- 
sistent with classification of L6 neurons in this region’, could be sub- 
divided into two morphologically distinct categories: those whose 
apical dendrites ended in L4 and those that extended to L1 (Fig. 1b 
and Supplementary Fig. 1g, h). Furthermore, consistent with the cor- 
ticothalamic projections originating from L6 in V1 (ref. 8), Cre- 
expresssing neurons projected to the dorsolateral geniculate nucleus 
(dLGN; the primary thalamic visual nucleus) and the nucleus reticularis 
thalami (NRT; the main thalamic inhibitory nucleus) (Fig. 1b and 


Supplementary Fig. 1d, e). Thirty-five percent of L6 excitatory neurons 
in V1 did not express Cre and these were morphologically distinct from 
the Cre-expressing population (Supplementary Fig. 1g). 

To manipulate the activity of L6 neurons we conditionally 
expressed the light-sensitive cation channel channelrhodopsin 2 
(ChR2)*°”* in V1 using viral injection into NTSR1-Cre mice (Sup- 
plementary Fig. 2a). A linear multichannel probe recorded the spiking 
activity of neuorns located across the vertical depth of cortex. Light- 
emitting diode (LED) illumination of the cortical surface for 500 ms 
with blue light (470 nm) increased the activity of L6 neurons in V1 of 
anaesthetized animals (Fig. lc-e and Supplementary Fig. 2b). This 
increase was not due to direct stimulation of the retina by the LED as it 
was absent in uninjected animals (Supplementary Fig. 2g). 


L6 activity suppresses other layers 


To determine how L6 activation affects visual responses in other 
layers, we presented drifting gratings, and alternated control trials 
(visual stimulus only) with trials in which L6 was photostimulated 
(Fig. 1c). Notably, photostimulation of L6 rapidly and reversibly 
suppressed visually evoked multi-unit activity throughout the depth 
of the cortex (Fig. 1d). L6 photostimulation also reduced spontaneous 
activity (Supplementary Fig. 3d, e). This effect was absent in uninjected 
animals (Supplementary Fig. 2g). The suppressive action of L6 was 
similar across L2/3, L4 and L5 and was monotonic (Fig. 1e,f): that is, 
increasing L6 activity by increasing illumination intensity progres- 
sively suppressed visual responses, eventually abolishing nearly all 
evoked activity (strongest illumination reduced activity by 81 + 5%, 
84 + 3%, and 84 + 3% for L2/3, L4 and LS, respectively; P < 10°). 
Because multi-unit activity is dominated by neurons with high firing 
frequencies, we determined the effect of L6 photostimulation on iso- 
lated single units whose average visually evoked firing rate varied over a 
20-fold range. Isolated units were suppressed by L6 photostimulation 
(Fig. 1g), irrespective of their firing rates (Fig. 1h; 91.1% of units were 
suppressed and 7.8% were facilitated, and all facilitated units were 
fast-spiking, putative inhibitory cells (Supplementary Fig. 4a-d). 
Furthermore, in the same way as for multi-unit activity, L6 photosti- 
mulation monotonically suppressed single units (Fig. 1i, j; strongest 
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Figure 1 | Photostimulation of L6 suppresses visual responses in the other 
layers. a, Schematic of L6 projections. Red triangle represents an L6 pyramidal 
neuron. b, Left, coronal section of V1 from an NTSR1-Cre, floxed-tdTomato, 
GAD67-GFP mouse. Inset, L6 projection to dLGN (V1 of NTSR1-Cre mouse 
was injected with floxed-tdTomato virus). Scale bar, 250 tum (125 kum for inset). 
Right, the two types of L6 neurons that are labelled by the NTSR1-Cre line. 
Black, dendrites; grey, axons. c, Schematic of experimental setup. Rec., 
recording probe. d, Cortical visual responses with (blue) and without (black) L6 
photostimulation. Left, raster plot of multi-unit activity grouped by depth. 
Control and photostimulation trials were interleaved but are separated here for 


illumination reduced activity by 91 + 4%, 93 + 2%, and 92 + 2% for 
12/3, L4 and LS, respectively; P < 10 °). Thus, these data show that 
stimulation of L6 excitatory neurons suppresses visually evoked res- 
ponses in L2/3, L4 and L5 of V1. 


L6 activity does not affect tuning 


Like in other mammals, neurons in mouse V1 differentially respond to 
gratings of different orientations”. We determined whether L6 
stimulation affects the orientation tuning of V1 neurons. We generated 
tuning curves by presenting gratings drifting in 8-12 different direc- 
tions and alternated control trials with trials in which L6 was photo- 
stimulated (Fig. 2a, b). We used alow LED intensity to suppress cortical 
visual responses partially, and considered units that were suppressed 
by between 10% and 75% (average suppression 42 + 3%, n = 55). 
Tuning curves of individual, isolated units were averaged into a popu- 
lation tuning curve (Fig. 2b, d; see methods). Remarkably, photo- 
stimulation of L6 resulted in the precise scaling of the tuning curve; 
that is, it reduced visually evoked responses by a similar fraction 
irrespective of presented orientation. This is clearly illustrated by 
plotting the normalized firing rates of the population tuning curve 
under control versus L6 photostimulation conditions (Fig. 2e). The 
data points fit well with a line whose slope is 0.56 and intercepts the 
y axis close to the origin. Thus, photostimulation of L6 did not affect 
preferred orientation, tuning width or the orientation selectivity index 
(OSD of cortical neurons throughout L2/3, L4 and L5 (Fig. 2c; for L2/3, 
L4andL5, respectively, the mean change in preferred orientation was 3 
+ 3° (P=0.22),0 + 5° (P=0.9) and —4 + 5° (P = 0.48), mean change 
in tuning width was —1 + 4° (P = 0.8), 6 + 4° (P = 0.15) and —6 + 6° 
(P = 0.3), and mean change in OSI was —0.09 + 0.07 (P = 0.23), 
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clarity. Black bar, visual stimulus (1.5 s); blue bar, LED illumination (0.5 s). 
Right, normalized (Norm.) peristimulus time histogram (PSTH); top, upper 
layers; bottom, L6. e, Summary (n = 6 experiments). The control is shown in 
black and increasing LED intensities in darker blues. f, Suppression of multi- 
unit activity with increasing Lé6 activity. g, Visual response of a single L4 unit 
with (blue) and without (black) L6 photostimulation. Scale bar, 20 spikes per s. 
h, Response of each regular spiking unit with and without strong 
photostimulation of L6. i, Average normalized PSTH (n = 47 units tested with 
5 LED intensities). Colours are the same as in e. j, Suppression of single-unit 
activity. Error bars, mean = s.e.m. 


0.7 + 0.04 (P = 0.14), —0.06 + 0.05 (P = 0.22)). L6 photostimulation 
also resulted in a scaling of V1 responses to stimuli of increasing 
contrast (the contrast response function; Supplementary Fig. 5b). 
These data demonstrate that in primary visual cortex L6 selectively 
controls the gain of cortical responses to visual stimuli. 

A potential concern in stimulating L6 with ChR2 is that the 
spatially uniform activation and the temporal pattern generated in 
L6 neurons may differ from visually evoked activity patterns, and thus 
the physiological activity of L6 neurons and L6 photostimulation may 
affect cortical activity in different ways. Furthermore, anaesthesia may 
change the impact of L6 on cortical responses to sensory stimuli. To 
address these issues, we optogenetically suppressed visually evoked 
activity in L6 in awake animals and determined the resulting effect on 
more superficial layers (Supplementary Fig. 6). Animals were head 
fixed but otherwise kept unrestrained on a passive circular treadmill 
(see Methods). L6 activity was suppressed using conditionally 
expressed light-sensitive hyperpolarizing opsins archeaerhodopsin™* 
and halorhodopsin 3.0 (NpHR3.0) (ref. 25). LED illumination with 
amber light (590 nm), although reducing visually evoked L6 activity 
by ~30% (Supplementary Fig. 6e), significantly facilitated visual res- 
ponses of isolated units throughout the other layers (Fig. 2f, g and 
Supplementary Fig. 6). The facilitation was not due to direct LED 
illumination of the retina, as it was absent in uninjected animals 
(Supplementary Fig. 6f). Thus, suppression of L6 facilitates visually 
evoked activity in L2/3, L4 and L5, indicating that even physiologically 
generated L6 activity exerts a suppressive action onto these layers. 
Furthermore, suppression of L6 resulted in the precise scaling of the 
tuning curve (for the tuning curve analysis we considered units that 
were facilitated by at least 10% (average facilitation 41 + 7%, n = 52)). 
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Figure 2 | L6 bidirectionally modulates the gain of visual responses without 
altering tuning. a, Visual responses of an L5 neuron with (blue) and without 
(black) L6 photostimulation. Raster plots and peristimulus time histograms for 
two out of eight tested visual stimulus directions. Scale bar, 40 spikes per s. 

b, Tuning curves for the neuron in (a). c, The OSI for each neuron with and 
without photostimulation of L6. d, Population tuning curve with (blue) and 
without (black) L6 photostimulation (n = 55). Black curve, fit using the sum of 
two Gaussians; blue curve, black curve scaled by the slope of linear fit in 

e. e, Control response plotted against response with L6 photostimulation (data 
from c). Blue, linear fit (7 = 0.98). Black bar, visual stimulus (1.5 s); blue bar, 


The plot of normalized firing rates under control versus L6 photo- 
suppression conditions was well fit by a line whose slope is 1.4 and 
intercepts the y axis very close to the origin (Fig. 2j). Consistent with 
this, suppressing L6 did not affect preferred orientation, tuning width 
or orientation selectivity (Fig. 2h; for L2/3, L4 and LS, respectively, the 
mean change in preferred orientation was 2 + 3° (P = 0.41), 0 + 2° 
(P = 0.95) and —4 + 4° (P=0.35) degrees, mean change in tuning 
width was —2 + 4° (P=0.68), 0 + 3° (P= 0.94) and —1 + 4° 
(P= 0.77) degrees, and mean change in OSI was —0.01 + 0.03 
(P = 0.22), 0.02 + 0.02 (P= 0.50) and —0.03 + 0.03 (P= 0.22)). 
Taken together, these results demonstrate that visually driven L6 
activity in awake animals controls the gain of cortical responses to 
visual stimuli. 


L6 intracortical and subcortical pathways 


Two pathways could potentially mediate the suppression exerted by 
L6 on cortical activity. On one hand, L6 neurons project to the 
thalamus, where they can influence visually generated activity before 
it even reaches the cortex. On the other hand, L6 neurons also project 
to more superficial layers where they could directly modulate cortical 
activity. We addressed the impact of both projections. We performed 
extracellular recordings from the dLGN while photostimulating L6 in 
V1 (Fig. 3a). (LGN relay neurons were identified based on their visual 
response properties and characteristic firing pattern (Supplemen- 
tary Fig. 7d). Photostimulation of L6 led to a rapid, reversible and 
monotonic reduction of visually evoked activity in dLGN relay 
neurons (Fig. 3b, c; strongest illumination: 76 + 4% reduction; 
P<10 % n= 32), without, however, markedly modifying their 
firing mode (burst prevalence: 12 + 6% in control; 6 + 3% after 
reducing dLGN activity by 30% with L6 photostimulation, P = 0.08; 
Supplementary Fig. 7e, f). This indicates that L6é stimulation suppresses 


Norm. rate LED off 


Direction (deg) 


LED illumination (0.5 s). f, Visual response of an L4 neuron with (orange) and 
without (black) L6 photosuppression. Scale bar, 50 spikes per s. Orange bars, 
illumination with an amber-coloured LED (1.95 s); black bar, visual stimulation 
(1.5 s). g, Tuning curves for neuron in (f). h, OSI for each isolated unit with and 
without photosuppression of L6. i, Population tuning curves with and without 
L6 photosuppression (n = 52). Black curve, fit using sum of two Gaussians; 
orange curve, black curve scaled by slope of linear fit in j. j, Control response 
plotted against response with L6é photostimulation (data from i). Orange, linear 
fit (7 = 0.92). Error bars, mean + s.e.m. 


dLGN activity. To test whether visually evoked activity in L6 also 
suppresses dLGN activity we silenced the cortex optogenetically (by 
photostimulating parvalbumin-expressing inhibitory neurons in V1 
with ChR2; see Methods and Supplementary Fig. 8). Consistent with 
the suppressive action of L6 stimulation on dLGN, silencing the cortex 
strongly facilitated dLGN activity (Fig. 3d-f; average facilitation 
87 + 25% (P = 0.002, n = 18)). In vitro recordings demonstrated that 
the suppressive action of L6 was due to the generation of disynaptic 
inhibition onto dLGN relay neurons, at least in part through the 
recruitment of NRT inhibitory neurons (and possibly through the 
recruitment of local inhibitory neurons in dLGN*°) (Supplementary 
Fig. 9). Thus, these results reveal that L6 can effectively suppress visual 
responses in the dLGN. 

If L6 suppresses cortical visual responses indirectly, by suppressing 
the dLGN, this suppression should precede V1 suppression by a few 
milliseconds. We tested this prediction by performing simultaneous 
recordings from both dLGN and V1 and compared the onset of sup- 
pression in these two structures upon L6 photostimulation (Fig. 3g). 
Surprisingly, cortical suppression preceded dLGN suppression by a 
few milliseconds (Fig. 3h). This result suggests that L6 activity may 
suppress cortical visual responses through an alternative circuit. 
Because L6 neurons send axons to the upper layers of cortex we tested 
whether these projections can suppress cortical activity independently 
of the corticothalamic projections. For this, we performed in vitro 
whole-cell recordings from neurons in L2/3, L4, L5 and Lé6 in coronal 
slices of V1 (Fig. 4a); this slicing plane disconnects V1 from dLGN. 

Photostimulation of L6 in vitro generated both excitatory and 
inhibitory postsynaptic currents (EPSCs and IPSCs, respectively) 
onto L2/3, L4, L5 and L6 pyramidal cells (L6 recordings included only 
those pyramidal cells not expressing ChR2) (Fig. 4b). IPSCs were of 
disynaptic (or polysynapyic) origin as they were entirely blocked by 
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Figure 3 | Photostimulation of L6 suppresses cortex faster than it 
suppresses dLGN. a, Schematic of the experimental setup. b, Visual response 
of dLGN unit with (blue) and without (black) L6 photostimulation. Scale bar, 
20 spikes per s. Black bar, visual stimulus (1s); blue bar, LED illumination 
(0.5 s). ¢, Average response of each dLGN unit with and without L6 
photostimulation. Inset, monotonic suppression of dLGN. d, Schematic of 
setup for silencing V1 by photostimulation of parvalbumin inhibitory neurons. 
e, Visual response of dLGN unit with and without photo-silencing of V1. Scale 
bar, 30 spikes per s. Black bar, visual stimulus (1 s); blue bar, LED illumination 
(0.5 s). f, Average response of each dLGN unit with and without cortical 
silencing. g, Schematic of experimental setup. h, Left, time-course of L6- 
mediated suppression of dLGN (grey) and V1 (black) (n = 4). Residual 
response during maximal suppression set to zero (see Methods). Bin size, 3 ms. 
Right, the same data on an expanded timescale. The first bin at LED onset was 
blanked to remove LED-induced artefact. Inset, time to suppression exceeding 
two standard deviations from baseline activity in dLGN and V1 for four 
experiments (P = 0.012). Error bars, mean + s.e.m. Inset, y-axis units are ms. 


glutamatergic antagonists (Supplementary Fig. 10b). Furthermore, 
the activity pattern generated by L6 photostimulation was similar to 
the activity pattern generated in vivo (Supplementary Fig. 2b, h). 
IPSCs were larger than EPSCs, despite the fact that both currents were 
recorded with a similar driving force (IPSCs were recorded near the 
reversal potential for EPSCs and vice versa). Indeed, excitatory charge 
accounted for only 10% or less of the total charge, depending on the 
layer (Fig. 4c) or sublayer (Supplementary Fig. 10c, d). These results 
show that V1 contains the necessary circuitry for L6 to generate 
disynaptic inhibition onto L2/3, L4, L5 and onto itself. 

To determine whether L6 can suppress neuronal spiking across 
L2/3, L4, L5 and L6 through these disynaptic IPSCs, we performed 
current-clamp recordings in the perforated patch configuration (to 
preserve the intracellular ionic composition) and triggered spiking by 
injecting depolarizing current pulses. Photostimulation of Lé6 signifi- 
cantly suppressed firing of pyramidal cells across all layers (Fig. 4d; 
firing rate was reduced by 48+ 10%, 84+7%, 55+19% and 
75 + 11% for L2/3, L4, L5 and L6, respectively; P= 0.01). To rule 
out the possibility that this suppression was a result of uniformly 
activating large portions of L6 we restricted the area of activation to 
a small spot of approximately 100 um in diameter while recording 
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Figure 4 | Photostimulation of L6 recruits intracortical synaptic inhibition. 
a, Schematic of in vitro experimental setup. b, Average IPSCs (blue) and EPSCs 
(red) recorded in pyramidal cells during photostimulation of L6. Synaptic 
currents are averages of n = 5-12 cells. Inset, onset of EPSC. c, Histogram of 
excitatory charge as a percentage of total charge. Ex, excitation; Inh, inhibition. 
d, Traces show perforated patch recording from L5 pyramidal cell in response 
to depolarizing current injection with (right) and without (left) L6 
photostimulation. Graphs, spike rate with and without L6 photostimulation. 
e, Average spike rate in control versus spike rate with L6 photostimulation for 
each cell. f, Schematic of experimental setup for focal photostimulation. 

g, Traces, spiking of L5 pyramidal cell to depolarizing current injection with 
focal photostimulation of L6 at three progressively more distant positions (left 
to right). Graph shows spike rate in control (black) and with focal 
photostimulation of L6 (blue) (n = 4). Delta indicates the medial or lateral 
distance from the radial axis through the recording site. h, Percentage of spike 
suppression plotted against horizontal displacement. Error bars, mean + s.e.m. 


from a L5 neuron (Fig. 4f). Even when activating a restricted area of 
L6, the firing of L5 neurons was robustly suppressed (Fig. 4g). The 
suppression was maximal when L6 photostimulation was aligned with 
the recorded L5 neuron along the cortical radial axis, and decreased 
progressively as the photostimulation spot was translated tangentially 
(Fig. 4g, h). These results demonstrate that V1 can efficiently suppress 
activity in L2/3, L4, L5 and Lé6 in the absence of thalamus. 


Major role of L6 intracortical circuits 

Taken together, these results indicate that L6 can modulate cortical 
responses to visual stimuli through two independent circuits: indirectly, 
through the corticothalamic circuit and directly, through the intracor- 
tical circuit. To test whether one of these two circuits has a dominant 
role, we examined how much of the V1 suppression is predicted by 
dLGN suppression. We first established the transfer function between 
dLGN and V1. For this we performed simultaneous in vivo recordings 
from these two structures while presenting full-field drifting gratings of 
varying contrasts to obtain contrast response functions for the (LGN 
and V1 (Fig. 5a, b). By plotting dLGN versus V1 activity at each 
contrast we obtained the transfer function from dLGN to V1, which 
provides the response of V1 to various levels of dLGN activity (Fig. 5c). 
Finally, we presented the strongest contrast and photostimulated L6 to 
reduce dLGN activity while simultaneously monitoring V1 activity. 
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Figure 5 | L6 suppresses upper layers largely 
through intracortical circuits. a, Schematic of 
experimental setup. b, Simultaneously recorded 
multi-unit responses to increasing contrasts (light 
to dark) in V1 (top) and dLGN (bottom). All spikes 
that were recorded above L6 (=650 tm) were 
included in V1 multi-unit activity. Scale bar, 200 
spikes per s for V1; 100 spikes per s for LGN. Black 
bar, visual stimulus (1.5 s). Dotted line, baseline 
activity. Right, contrast-response functions. 
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We reasoned that if the ensuing reduction of V1 activity matches 
the reduction predicted by the transfer function, the modulation of 
cortical responses by L6 is mainly due to dLGN suppression by the 
corticothalamic circuit. However, if the reduction in V1 activity 
exceeds that predicted by the transfer function, the additional reduc- 
tion can be attributed to the intracortical circuit. We reduced d(LGN 
activity by ~10%, 20% and 50% through activation of L6 with three 
progressively stronger illuminations (Fig. 5d). Notably, even the 
smallest reduction in dLGN activity (10%) was accompanied by a 
reduction in V1 activity that largely exceeded that predicted by the 
transfer function (Fig. 5e). Furthermore, a 50% suppression of (LGN 
activity was accompanied by a complete suppression of visually evoked 
activity in V1. In this experiment a large fraction of V1 suppression 
(73% averaged over five LED levels) exceeded the transfer function 
prediction and must therefore be attributed to the intracortical circuit 
(average intracortical component over all experiments 73 + 5%, n = 5; 
Fig. 5f). Furthermore, given the relatively minor effects on the preval- 
ence of burst firing in dLGN neurons (Supplementary Fig. 7f), this 
effect cannot be attributed to a change in the firing pattern of (LGN 
neurons. These results indicate that L6 suppresses cortical responses to 
visual stimuli mainly through intracortical circuits. 


Discussion 

Taken together, this study shows that L6 modulates visually evoked 
activity across L2/3, L4 and L5. This modulation occurs continuously 
through visually driven L6 activity, as shown in awake animals, and 
does not affect orientation tuning indicating that L6 selectively con- 
trols the gain of cortical visual responses. Finally, despite suppression 
of the dLGN, cortical gain control by L6 is executed largely by intra- 
cortical circuits. 

Response gain modulation is a fundamental cortical operation” that 
is crucially involved in sensory representation and sensorimotor integ- 
ration. For example, visual responses in parietal cortex are gain modu- 
lated by gaze direction”’. Furthmore, gain modulation may underlie 
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1 c, dLGN-V1 transfer function derived by plotting 
1 normalized response in V1 versus dLGN (from 
b). Fit, hyperbolic ratio function. d, Simultaneously 
recorded multi-unit responses to maximal 
contrasts in V1 and dLGN without 
photostimulation (black) or while 
photostimulating L6 with increasing LED 
intensities (progressively darker blue). Same 
experiment as in b and c. Black bar, visual stimulus 
(1.5 s); blue bar, LED illumination (1.5 s). Scale bars 
are the same as in b. e, V1 versus LGN response to 
maximal contrast under control condition (black 
data point) or during three progressively stronger 
photostimulations of L6 (light, medium and dark 
blue, data from d). V1 responses are suppressed 
more than predicted by transfer function (red 
arrows) even for photostimulations that reduce 
dLGN activity only ~10% (light blue). f, Average 
intracortical component of suppression as a 
function dLGN suppression (n = 5 experiments). 
Intracortical component (red arrow in e) is 
quantified as a fraction of total V1 suppression 
(grey arrows plus red arrows in e). g, Schematic of 
the main finding. Error bars, mean + s.e.m. 
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the effects of attention on cortical responses to visual stimuli’?”®. 


However, the neuronal circuits that implement this operation have 
remained largely unknown. Identifying L6 as a contributor to cortical 
response gain modulation is an important step in dissecting the specific 
functions of distinct circuits in cortical processing. The suppressive 
action of Lé6 that is described here markedly differs from the facilitatory 
impact of other layers on cortical activity’*'*? (for example, L2/3 
facilitates L5 (ref.*)) and points towards a very distinct function of 
different layers in sensory processing. The cortical GABAergic inter- 
neuron subtype (or subtypes)**** that is recruited by Lé6 activity and 
mediates the reported suppressive effect remains to be identified, but 
may include fast spiking neurons (Supplementary Fig. 4). Although the 
exact synaptic mechanisms underlying gain control by L6 remains to be 
elucidated, either a proportional change in excitation and inhibition***° 
or the modulation of only one of the two opposing conductances*” may 
underlie the operation. The columnar organization of L6 pyramidal cell 
projections to more superficial layers'® ensures that L6-mediated sup- 
pression is restricted to the cortical domains that are directly above the 
activated L6 region (Fig. 4g, h). This topographic organization could 
allow the cortex to differentially modulate the gain of V1 responses to 
stimuli located in distinct regions of visual space. 

L6 has been suggested to contribute to ‘end inhibition’, the suppres- 
sion of cortical responses by bars above a given size’’. The powerful 
inhibitory currents generated by L6 onto more superficial pyramidal 
cells may represent the underlying cellular mechanism. 

Previous studies addressing the role of corticothalamic feedback 
projections through focal pharmacological perturbation of L6 neurons 
have typically reported a facilitation of functionally or topographically 
aligned thalamic neurons overlaid by broader surround suppression"’, 
resulting in changes to both spatial and temporal tuning properties of 
these neurons’***’. Our data obtained using full-field visual stimu- 
lation are consistent with this model, in which spatial summation of 
individual inhibitory surrounds will result in a net suppressive effect of 
the corticothalamic feedback projection. Future studies combining 
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optogenetic approaches with focal stimulation of visual space will 
reveal how fine-scale corticothalamic circuits*”” interact with intra- 
cortical L6 circuits to influence visual processing in the cortex. 

L6 in V1 receives convergent inputs from a variety of brain regions, 
including higher cortical areas*’ as well as thalamus'’. These various 
brain regions could thus influence, through L6, the gain of visual 
responses during the very initial steps of visual cortical processing. 


METHODS SUMMARY 


ChR2, archaerhodopsin and NpHR3.0 were conditionally expressed in mouse V1 
via stereotactic injection of adenoassociated viruses into NISR1-Cre mice’’. For 
recordings under anaesthesia, mice were injected with 5 mg kg” ' chlorprothixene 
and 1.2gkg | urethane. For awake experiments, a craniotomy was performed 
under isoflurane anaesthesia (1-1.5%), and then a previously implanted head- 
plate was used to fix the mouse on a circular treadmill and the anaesthetic was 
removed. In vivo extracellular recordings were made from V1 and dLGN using 
multichannel silicon probes. Visual stimuli were displayed on an LCD screen. 
Microbial opsins were photoactivated using a blue (470-nm) or amber (590-nm) 
LED placed above the thinned skull. In vitro whole-cell recordings were per- 
formed as previously described”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 29 August 2011; accepted 4 January 2012. 
Published online 22 February 2012. 


1. Lorente de No, R. in Physiology of the Nervous System (ed. Fulton, J.F.) 274-301 
(Oxford Univ. Press, 1943). 

2. Douglas, R.J.& Martin, K.A. Neuronal circuits of the neocortex. Annu. Rev. Neurosci. 
27, 419-451 (2004). 

3. Lefort, S., Tomm, C., Floyd Sarria, J. C. & Petersen, C. C. The excitatory neuronal 
network of the C2 barrel column in mouse primary somatosensory cortex. Neuron 
61, 301-316 (2009). 

4. Thomson,A.M.& Bannister, A. P. Interlaminar connections in the neocortex. Cereb. 
Cortex 13, 5-14 (2003). 

5. Callaway, E. M. Local circuits in primary visual cortex of the macaque monkey. 
Annu. Rev. Neurosci. 21, 47-74 (1998). 

6. Dantzker, J. L. & Callaway, E. M. Laminar sources of synaptic input to cortical 
inhibitory interneurons and pyramidal neurons. Nature Neurosci. 3, 701-707 
(2000). 

7. Thomson, A. M. Neocortical layer 6, a review. Front. Neuroanat. 4, 13 (2010). 

8. Bourassa, J. & Deschenes, M. Corticothalamic projections from the primary visual 
cortex in rats: a single fiber study using biocytin as an anterograde tracer. 
Neuroscience 66, 253-263 (1995). 

9. Binzegger, T., Douglas, R. J. & Martin, K. A. Stereotypical bouton clustering of 
individual neurons in cat primary visual cortex. J. Neurosci. 27, 12242-12254 
(2007). 

0. Zhang, Z. W. & Deschenes, M. Intracortical axonal projections of lamina VI cells of 
the primary somatosensory cortex in the rat: a single-cell labeling study. 

J. Neurosci. 17, 6365-6379 (1997). 

1. Jones, E. G. The Thalamus (Cambridge Univ. Press, 2007). 

2. Guillery, R. W. & Sherman, S. M. Thalamic relay functions and their role in 
corticocortical communication: generalizations from the visual system. Neuron 
33, 163-175 (2002). 

3. Sillito, A. M. & Jones, H. E. Corticothalamic interactions in the transfer of visual 
information. Phil. Trans. R. Soc. Lond. B 357, 1739-1752 (2002). 

4. Briggs, F. & Usrey, W. M. Emerging views of corticothalamic function. Curr. Opin. 
Neurobiol. 18, 403-407 (2008). 

5. Cudeiro, J. & Sillito, A. M. Looking back: corticothalamic feedback and early visual 
processing. Trends Neurosci. 29, 298-306 (2006). 

6. Sillito, A. M., Cudeiro, J. & Jones, H. E. Always returning: feedback and sensory 
processing in visual cortex and thalamus. Trends Neurosci. 29, 307-316 (2006). 

7. Bolz, J. & Gilbert, C. D. Generation of end-inhibition in the visual cortex via 
interlaminar connections. Nature 320, 362-365 (1986). 

8. Grieve, K. L. & Sillito, A.M. A re-appraisal of the role of layer VI of the visual cortex in 
the generation of cortical end inhibition. Exp. Brain Res. 87, 521-529 (1991). 

9. Gong, S. et al. Targeting Cre recombinase to specific neuron populations with 
bacterial artificial chromosome constructs. J. Neurosci. 27, 9817-9823 (2007). 

20. Nagel, G. et a. Channelrhodopsin-2, a directly light-gated cation-selective 
membrane channel. Proc. Nat! Acad. Sci. USA 100, 13940-13945 (2003). 

21. Boyden, E.S., Zhang, F., Bamberg, E., Nagel, G. & Deisseroth, K. Millisecond- 
timescale, genetically targeted optical control of neural activity. Nature Neurosci. 8, 
1263-1268 (2005). 


6 | NATURE | VOL 000 | 00 MONTH 2012 


22. Niell, C.M. & Stryker, M. P. Highly selective receptive fields in mouse visual cortex. 
J. Neurosci. 28, 7520-7536 (2008). 

23. Hubel, D. H. & Wiesel, T. N. Receptive fields, binocular interaction and functional 
architecture in the cat’s visual cortex. J. Physiol. 160, 106-154 (1962). 

24. Chow, B. Y. et al. High-performance genetically targetable optical neural silencing 
by light-driven proton pumps. Nature 463, 98-102 (2010). 

25. Gradinaru, V. et a/. Molecular and cellular approaches for diversifying and 
extending optogenetics. Ce// 141, 154-165 (2010). 

26. Rafols, J. A. & Valverde, F. The structure of the dorsal lateral geniculate nucleus in 
the mouse. A Golgi and electron microscopic study. J. Comp. Neurol. 150, 303-331 
(1973). 

27. Salinas, E. & Thier, P. Gain modulation: a major computational principle of the 
central nervous system. Neuron 27, 15-21 (2000). 

28. Brotchie, P.R., Andersen, R.A. Snyder, L. H. & Goodman, S. J. Head position signals 
used by parietal neurons to encode locations of visual stimuli. Nature 375, 
232-235 (1995). 

29. Treue, S. & Martinez Trujillo, J. C. Feature-based attention influences motion 
processing gain in macaque visual cortex. Nature 399, 575-579 (1999). 

30. McAdams, C.J. & Maunsell, J. H. Effects of attention on orientation-tuning functions 
of single neurons in macaque cortical area V4. J. Neurosci. 19, 431-441 (1999). 

31. Silver, R.A. Lubke, J., Sakmann, B. & Feldmeyer, D. High-probability uniquantal 
transmission at excitatory synapses in barrel cortex. Science 302, 1981-1984 
(2003). 

32. Adesnik, H. & Scanziani, M. Lateral competition for cortical space by layer-specific 
horizontal circuits. Nature 464, 1155-1160 (2010). 

33. Markram, H. et al. Interneurons of the neocortical inhibitory system. Nature Rev. 
Neurosci. 5, 793-807 (2004). 

34. Ascoli, G.A. et al. Petilla terminology: nomenclature of features of GABAergic 
interneurons of the cerebral cortex. Nature Rev. Neurosci. 9, 557-568 (2008). 

35. Chance, F. S., Abbott, L. F. & Reyes, A. D. Gain modulation from background 
synaptic input. Neuron 35, 773-782 (2002). 

36. Shadlen, M. N. & Newsome, W. T. The variable discharge of cortical neurons: 
implications for connectivity, computation, and information coding. J. Neurosci. 
18, 3870-3896 (1998). 

37. Murphy, B. K. & Miller, K. D. Multiplicative gain changes are induced by excitation 
or inhibition alone. J. Neurosci. 23, 10040-10051 (2003). 

38. Andolina, |. M., Jones, H. E., Wang, W. & Sillito, A.M. Corticothalamic feedback 
enhances stimulus response precision in the visual system. Proc. Natl Acad. Sci. 
USA 104, 1685-1690 (2007). 

39. Wang, W., Jones, H. E., Andolina, |. M., Salt, T. E. & Sillito, A. M. Functional alignment 
of feedback effects from visual cortex to thalamus. Nature Neurosci. 9, 1330-1336 
(2006). 

40. Worgotter, F., Nelle, E., Li, B. & Funke, K. The influence of corticofugal feedback on 
the temporal structure of visual responses of cat thalamic relay cells. J. Physiol. 
509, 797-815 (1998). 

41. McClurkin, J. W. & Marrocco, R. T. Visual cortical input alters spatial tuning in 
monkey lateral geniculate nucleus cells. J. Physiol. 348, 135-152 (1984). 

42. Murphy, P. C., Duckett, S. G. & Sillito, A. M. Feedback connections to the lateral 
geniculate nucleus and cortical response properties. Science 286, 1552-1554 
(1999). 

43. Casagrande, V. A. & Kaas, J. H. The Afferent, lintrinsic and Efferent Connections of 
Primary Visual Cortex in Primates (eds Peters, A. & Rockland, P.) (Plenum, 1994). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We are grateful to M. Carandini, J. Isaacson and the members of 
the Scanziani and Isaacson laboratories for helpful discussions of this project, to 

J. lsaacson, R. Malinow and T. Komiyama for providing feedback on the manuscript, to 
P. Abelkop for histological help and neonatal viral injections, to J. Evora for mouse 
colony support and genotyping, to B. Atallah for sharing the technique for silencing the 
cortex by photostimulation of parvalbumin neurons and for help with the in vivo 
recording setup and to W. Bruns for help coding analysis software. We thank the UCSD 
Neuroscience Microscopy Facility (P30 NSO47101) for the use of their imaging 
equipment. S.R.O. and H.A. were supported by postdoctoral fellowships from the Helen 
Hay Whitney Foundation. D.S.B was supported by a UCSD Neurobiology Training Grant 
(NINDS: 5T32NS007220-28). M.S. is an investigator of the Howard Hughes Medical 
nstitute. This work was also supported National Institutes of Health grant RO1 
NS069010 and by the Gatsby Charitable Foundation. 


Author Contributions H.A. performed the initial physiological characterization of the 
NTSR1-Cre expression system with optogenetic tools. H.A. also developed the in vivo 
awake recording preparation on the treadmill. S.R.O. performed all in vivo recordings. 
D.S.B. performed all in vitro recordings and anatomical reconstructions. S.R.O. and M.S. 
designed the study. M.S. wrote the paper. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of this article at 
www.nature.com/nature. Correspondence and requests for materials should be 
addressed to M.S. (massimo@biomail.ucsd.edu) or S.R.O. (srolsen@ucsd.edu). 


©2012 Macmillan Publishers Limited. All rights reserved 


METHODS 


All procedures were conducted in accordance with the National Institutes of 
Health guidelines and with the approval of the Committee on Animal Care at 
the University of California, San Diego. 

Animals. We used the following mouse lines: NTSR1-Cre (strain B6.FVB(Cg)- 
Tg(Ntsr1-cre)GN220Gsat/Mmcd, stock number 030648-UCD), which was 
generated by the GENSAT project’? and acquired from the Mutant Mouse 
Regional Resource Centers; tdTomato reporter (Jax number 007908); GAD67- 
GFP (Aneo); and PV-Cre (Silvia Arber). 

Viral injections. Adeno-associated viruses (AAVs) for ChR2 and archaerhodopsin 
were acquired from the University of Pennsylvania Viral Vector Core: AAV2/ 
1.CAGGS.flex.ChR2.tdTomato.SV40 (Addgene 18917) and AAV2/9.flex.CBA. 
Archaerhodopsin-GFP.W.SV40 (Addgene 22222). An AAV virus (AAV2/9) for 
NpHR3.0 was produced at the Salk Viral Vector Core. The NpHR3.0 plasmid 
(pAAV-Efla-DIO-eNpHR 3.0-EYFP) was provided by K. Diesseroth. 

Viruses were loaded in a bevelled sharp micropipette mounted on a Nanoject II 
(Drumond) or a micropump injector (UMP-3 WPI) attached to a micromanipulator. 
ChR2 virus was injected into newborn pups (between postnatal days 0 and 2) that 
were anaesthestized on ice and secured into a moulded platform. Three 20-nl boli 
of virus was injected at each of three medial-lateral locations in V1 and two depths 
(500 jum and 650 pm) within V1. 

Archaerhodopsin was injected in combination with NpHR3.0 in juvenile 
(1-2-month-old) mice anaethestized with 2.5% isoflurane and placed into a 
stereotactic frame (Knopf). The exposed skull overlying V1 was thinned in three 
locations with a dental drill (Foredom) with a 300-m bur (Gesswein), and a hole 
was made with a (25-gauge) needle at each location to permit insertion of the 
injection pipette. A volume of 150 nl of virus was injected at a rate of 20 nl min! 
at each of the three locations and at two depths (900 tm and 700 um). The 
scalp was then sutured and the mouse injected subcutaneously with 0.1 mgkg | 
buprenorphine. In vivo recordings were made 1-2 months after viral injection. 
Slice preparation. Mice were anaesthetized with ketamine and xylazine (100 mg 
kg’ and 10mgkg ', respectively), perfused transcardially with cold sucrose 
solution (in mM: NaCl, 83; KCl, 2.5; MgSOu, 3.3; NaH2POx, 1; NaHCOs, 26.2; 
D-glucose, 22; sucrose, 72; and CaCly, 0.5, bubbled with 95% O2 and 5% CO3) and 
decapitated, and the visual cortex was cut into 300-400-11m coronal sections in 
cold sucrose solution. Thalamic slices were cut 45° off the coronal plane to 
maintain connections between NRT and dLGN. Slices were incubated in sucrose 
solution in a submerged chamber at 34°C for 30 min and then at room temper- 
ature (21 °C) until used for recordings. 

In vitro recordings. Whole-cell recordings were done at 32°C in artificial 
cerebrospinal fluid (in mM: NaCl, 119; KCl, 2.5; NaH2POg, 1.3; NaHCOs, 26; 
D-glucose, 20; MgCl, 1.3; CaCl, 2.5; and mOsm, 305, bubbled with 95% O3 and 
5% CO,). Excitatory and inhibitory synaptic currents were recorded using a 
caesium-based internal solution (in mM: CsMeSQOy,, 115; NaCl, 4; HEPES, 10; 
Na3GTP, 0.3; MgATP, 4; EGTA, 0.3; QX-314-Cl, 2.5; BAPTA(5Cs), 10; 
adjusted to pH7.4 with CsOH; mOsm 295; 3-5 MOhm pipette resistance). 
Voltage-clamp recordings were not considered if the series resistance exceeded 
20 MOhm or varied by more than 10%. Typically, 2-4 neurons were recorded 
from simultaneously. Cell-attached recordings and biocytin fills were carried out 
with a potassium-based internal solution (in mM: K-gluconate, 150; MgCl, 1.5; 
HEPES, 5; EGTA, 1.1; phosphocreatine, 10; adjusted to pH 7.4 with KOH; mOsm 
295). Perforated-patch recordings were carried out using potassium-based 
internal and 10 jg ml” * Gramicidin D (Sigma G5002). Tight seals were held until 
sufficient access allowed injection of current and resolution of action potentials 
(typically 10-20 min). Ruptures of the perforated patch were apparent by a rapid 
drop in series resistance at which point the recordings were discontinued. 
Photostimulation of L6 in vitro consisted of either single 2-ms pulses or a 40-Hz 
train of 2-ms pulses, or of 1-s ramps of light of increasing intensity as previously 
described**. Data were recorded with Multiclamp 700B amplifiers (Axon instru- 
ments) filtered at 2 kHz and digitized with a Digidatal440A (Axon instruments) at 
10kHz. Recordings were analysed using custom-made routines in Igor Pro 
(Wavemetrics). Charges represent the time integral of the synaptic current recorded 
during the first second of photostimulation. The stage was moved using a custom 
made plugin for ImageJ(NIH) to interface with ESP300 (Newport) via SerialPort 
(SerialIO). Drugs used were NBQX (Tocris 1044) and CPP (Ascent Asc-159). 

In vivo recordings in anaesthetized mice. Recordings were performed similarly 
to those previously described”. Animals were anaesthetized with 5mgkg | of 
chlorporthixene (intraperitoneal) and then (5-10 min later) with 1.2g kg? 
urethane (intraperitoneal). During surgery, animals were given 0.5-1.0% isoflurane. 
Animals were placed onto a custom platform and their temperature was maintained 
at 37°C using a feedback-controlled heating pad (FHC). Whiskers and eyelashes 
that were contralateral to the recording side were trimmed and eyes covered with a 
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thin, uniform layer of silcone oil to prevent drying. Protein expression was verified 
by transcranial epifluorescence of the exposed and PBS-moistened skull using a 
Leica MZ10F microscope. Only animals showing expression over the entire extent 
of V1 were used for subsequent experiments. The entire dried skull was covered 
with black dental cement (Ortho-Jet powder (Lang Dental) mixed with black iron 
oxide) but for the previously outlined boundaries of V1 (~1.5-3.5 mm lateral to 
midline and —0.5 to 2.5 mm anterior to lambda suture). A head-plate with a hole of 
~2mm in diameter was mounted over V1 and a small region of skull (~300 x 
750 jum) was thinned using a dental drill. Next, we used sharpened fine forceps 
(Dumont number 55) to make a craniotomy just sufficiently large for inserting the 
probe. A drop of PBS placed in the well at the centre of the head-plate kept the 
exposed skull and craniotomy moist. A multichannel silicon probe mounted on a 
micromanipulator (Luigs-Neumann) was slowly advanced into the brain toa depth 
of 800-1000 jum for linear probes and 200-700 jum for tetrode probes (see later), 
and recordings were started 20 min or more after inserting the probe. 

For dLGN recordings we made a circular craniotomy (~1.5mm in diamater) 
2.6 mm posterior and 2 mm lateral to the bregma suture. Robust visual responses and 
bursting activity that was characteristic of LGN relay neurons were encountered at a 
depth between 2,400 and 3,100 jum“ (Supplementary Fig. 7). For dual recording 
experiments (Fig. 3g, h and Fig. 5), we used a larger head-plate so that a craniotomy 
could be made both over the dLGN and V1. 

Recordings were made with NeuroNexus 16-channel linear (alx16-3mm-50- 
177) or tetrode (a2x2-tet-3mm-150-121) silicon probes. For recordings across 
cortical depth and in dLGN we used the linear configuration. The tetrode con- 
figuration was used to isolate a subset of cells in Fig. 2. Signals were amplified 
X 1000, band-pass filtered between 0.3 Hz and 5 kHz using an AM Systems 3500 
amplifier and acquired at 32 kHz using a NIDAQ board (PCle-6239) controlled 
with custom-written software in Matlab (Mathworks). For dual recording experi- 
ments we used two separate data-acquisition setups (amplifier, NIDAQ board 
and computer). Raw data were stored on a computer hard drive for offline 
analysis. 

At the end of the recording session, animals were killed by administering 4% 

isoflorane and the brain was quickly removed and fixed in 4% paraformaldehyde 
for histological analysis. 
In vivo awake recordings. 1-2 weeks before recording, mice were implanted with 
a head-plate for head fixation. Mice were anaesthetized with 2.5% isoflurane, the 
scalp was removed and a head-plate was fixed over V1 with black dental cement. 
The skull directly overlying V1 was covered with Kwik-Cast (WPI). Animals were 
injected subcutaneously with 0.1 mg kg” ' buprenorphine and allowed to recover 
in their home cage for at least 1 week before recording. 

Several days before recording, mice were familiarized to head fixation within 
the recording setup. They were briefly anaesthetized with isoflurane and the head- 
plate was clamped to a metal post, but otherwise the mice were unrestrained and 
allowed to run in this position on a plastic circular treadmill or track (Fast-Trac 
from Bio-Serv; see Supplementary Fig. 6). The same circular track was present in 
the cages of the mice, where they were familiarized with its use. Mice grew 
accustomed to head fixation over the course of 1-3 15-min sessions and ran 
naturally on the track, occasionally stopping to rest or groom. 

On the day of recording, mice were anaesthetized with 1.5-2% isoflurane, a 
small craniotomy was made over V1, a drop of PBS was placed in the well of a 
head-plate that was clamped to a metal post, and the multichannel probe inserted 
into the craniotomy. After removal of isoflurane the mice regained consciousness 
and typically began running. Recordings did not start before 30 min after the end 
of anaesthesia. Awake recording sessions lasted between 1 and 2 h. Mice typically 
spent ~60-80% of their time running, and the rest of the time was spent resting or 
grooming. Data were not separated according to behaviour. Every 30-60 min 
mice were given a few drops of a 5% glucose solution through a disposable pipette. 
For two mice we performed 2-3 recording sessions, which were made at least a 
day apart. Between sessions the craniotomy was covered with Kwik-Cast. A new 
craniotomy was made for each session. 

Visual stimulation. Visual stimuli were generated in Matlab using the 
Psychophysics Toolbox” and were displayed on a gamma-corrected LCD monitor 
(Dell 52 X 32.5 cm, 60-Hz refresh rate, mean luminance 50 cd m ”) positioned 
25 cm from the contralateral eye. The monitor was positioned for each experiment 
so that the multi-unit receptive field was located approximately in the centre of the 
screen (the multi-unit receptive field was determined by moving a localized drift- 
ing grating patch (~10°) around the screen). During the recording session full- 
field sinusoidal drifting gratings were used. All stimuli had a temporal frequency of 
2 Hz and a spatial frequency of 0.04 cycles per degree. Gratings were randomly 
presented at 8-12 equally spaced directions, except for the experiments in Fig. 5 in 
which we used only two orthogonal grating directions (0° and 90°). The contrast of 
the stimulus was 100%, except for Fig. 5 in which we used six contrast levels (2, 4.4, 
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9.6, 21, 46 and 100%). A grey screen trial was interleaved with the drifting gratings. 
The duration of the visual stimulus was 1.5 s and the inter-trial interval was 3-6 s. 
In vivo photostimulation. To photo-stimulate ChR2 we used a blue (470-nm) 
fibre-coupled LED (1 mm diameter, Doric Lenses) placed ~5-10 mm away from 
the skull. Light from the LED spanned the entire area of V1. An opaque shield of 
black aluminium foil (Thor Labs) prevented LED light from reaching the contra- 
lateral eye. The LED was driven by the analogue output from the NIDAQ board. 
The blue LED was presented at five intensities (approximately 3, 5, 7, 10.5 and 20 
mW measured at the tip of the fibre), but for a minority of experiments we 
presented only the highest LED intensity. Trials were alternated between visual 
stimulus only and visual stimulus plus LED. The strongest LED intensity also 
generated oscillations at gamma frequency, consistent with previous observa- 
tions** (Supplementary Fig. 2). The preferred-orientation of photostimulated 
L6 cells remained unchanged but their tuning curves became broader 
(Supplementary Fig. 2). 

To photostimulate archaerhodopsin and NpHR3.0 we used an amber (590-nm) 
fibre-coupled LED (1 mm in diameter, Doric Lenses) placed ~0.5mm from the 
skull. Because photosuppression of L6 produced a transient decrease in spontaneous 
multi-unit activity in L2-5 at the onset of LED illumination (see Supplementary 
Fig. 6) we turned on the amber LED 1.4s before the visual stimulus began. 
Experiments were performed at the highest LED intensity (~20 mW measured at 
the tip of the fibre). As long as the suppresssion was not complete, the preferred 
orientation of photosuppressed L6 cells remained unchanged (Supplementary Fig. 6). 
In vivo data analysis. All in vivo data analysis was performed with custom 
software written in Matlab. 

Multi-unit spiking activity was defined as all events (spikes) exceeding a 
threshold of 4 s.d. above the noise of the high-pass filtered (500-Hz) signal. 
Spikes were assigned a depth corresponding to the depth of the channel they 
were recorded from. Spikes that were recorded simultaneously on multiple 
channels were considered as a single event and attributed to the channel in which 
they showed the largest amplitude. We determined the depth of each channel by 
considering the depth and the angle of the probe relative to the vertical axis of 
cortex. We assigned spikes to different layers according to the following depths 
(in pm): L2/3, 100-350; L4, 350-450; L5, 450-650; L6, >650. PSTHs were 
composed of 50-ms bins. PSTHs of individual experiments were normalized to 
the first 500 ms of the visual stimulus (for ChR2 experiments) or to the entire 
visual stimulus (for archaerhodopsin and NpHR3.0 experiments) to generate 
average PSTHs. PSTHs for kinetic analysis (Fig. 3h) were composed of 3-ms bins 
and report the normalized difference in firing rates between control (average 
firing over a 50-ms window prior to LED onset) and during LED illumination 
(average firing rate over a 100-ms window, 50 ms after LED onset). For each 
experiment the onset of suppression was determined as the time point at which 
the normalized response fell below 2 s.d. of the baseline. 

The contrast response functions in dLGN and V1 report the normalized, 
baseline-subtracted firing rates and were fitted with a hyperbolic ratio function: 


n 


c 
r=T1max C+ chy 
where r is the response, c is the contrast of the visual stimulus, ryax is a fitted 
constant representing the response saturation level, n is fitting exponent that 
affects the shape of the curve and cso is the semi-saturation constant. The transfer 


function between the dLGN and V1 was fitted with a hyperbolic ratio function: 
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where ry; is the V1 response, ryi,max is a constant representing the V1 saturation 
level, raign is the LGN response, n is a fitting exponent and rg; Gn,50 is the semi- 
saturation constant. The ‘corticothalamic component’ (CT) was defined as the 
fraction of the total V1 suppression accounted for by this predicted response. The 
‘intracortical component’ was then defined as 1—CT component. We performed 
this analysis for five LED levels and averaged across experiments to produce the 
plot in Fig. 5f. 

We isolated single units using spike-sorting software provided by D. N. Hill, 
S. B. Mehta, and D. Kleinfeld”. For both the linear and tetrode probes we analysed 
waveforms extracted from groups of four adjacent electrode sites. We high-pass 
filtered the raw signal at 500 Hz and then detected spiking events exceeding 4-5 
s.d. of the noise. Spike waveforms were clustered using a k-means algorithm. After 
initial automated clustering, we used a graphical user interface to manually merge 


and split clusters. Unit isolation quality was assessed by considering refractory 
period violations and Fisher linear discriminant analysis. In agreement with 
previous studies we could classify waveforms as regular-spiking or fast-spiking 
putative inhibitory neurons. In our data set there was a clear bimodal distribution 
of trough-to-peak times (a threshold of 0.4ms was used to divide fast-spiking 
from regular-spiking units). All units were assigned a depth according to the 
channel that they were detected on, and units were assigned to layers based on 
the depth divisions given above for the multi-unit activity. 

For each unit we computed the visual response as the mean spike-rate occur- 
ring over the time window in which both the LED and visual stimulus were 
present. Thus, for the L6 photostimulation experiments this typically corre- 
sponded to a 500-ms window placed in the centre of the visual response, and 
for the L6 photosuppression experiments this window encompassed the entire 
1.5-s visual stimulus. For all analysis except the orientation tuning analyis in 
Fig. 2, we averaged responses over all stimulus conditions. Following recent 
studies*”* of orientation tuning we computed an OSI as: 


oat Ve rg sin(204.))? + (So 1 cos(20x))? 
wt 
where 1, is the response to the kth direction given by (,. We determined an OSI 
for each unit with and without photostimulation or suppression of L6. We estab- 


lished the preferred orientation and tuning width by first fitting the average 
responses of each unit with a sum of two Gaussians: 
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where ro is a constant offset, r, is the response at the preferred orientation, r, +180 
is a response 180° away from the preferred direction, 0 is the stimulus direction, 
Oy is the preferred orientation and a is the tuning width. The two Gaussians were 
forced to peak 180° apart and to have the same width but could have different 
amplitudes. Control and photostimulation or photosuppression conditions were 
fit separately. To generate the average population tuning curve we first circularly 
shifted the stimulus direction of each unit so that the maximal response occurred 
at 0°. We then normalized the responses to this peak response and averaged all 
normalized tuning curves together. We fit the control population average tuning 
curve with a sum of two Gaussians. The curve for the photostimulation or photo- 
suppression population average was produced by scaling the control curve by the 
slope (gain factor) of the linear fit shown in Fig. 2e, j. 

All error bars are presented as mean + s.e.m. unless otherwise noted. We used 
paired t-tests to assess statistical significance unless otherwise noted. 
Histology. Triple transgenic mice (Ntsr1-Cre, floxed-tdTomato and Gad67-GFP) 
were anaesthetized with ketamine and xylazine (100mgkg ‘ and 10mgkg ', 
respectively) and perfused with cold sucrose (see above) and then perfluoroalkoxy 
(4% in PBS). After 24 h incubation in perfluoroalkoxy, slices were cut into 50-ym 
sections and immunostained as described previously’. Antibodies that were used 
were mouse anti-NeuN (1:400; Millipore MAB377), chicken anti-GFP (1:1000; 
Aves Labs GFP-1020), goat anti-chicken AF488 (1:1,000; Invitrogen A11039) and 
goat anti-mouse AF633 (1:1,000; Invitrogen A21050). Slices were mounted in 
Vectashield with Dapi (Vector Labs, H1500). Images were single confocal sections 
taken on an Olympus FV1000. Layer borders were identified by changes in cell 
density. Cell counts were carried out using standard stereological techniques. 
Biocytin fills and neural reconstructions were done as previously described”. 
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Strict evolutionary conservation followed rapid gene 
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The human X and Y chromosomes evolved from an ordinary pair 
of autosomes during the past 200-300 million years’ *. The human 
MSY (male-specific region of Y chromosome) retains only three 
percent of the ancestral autosomes’ genes owing to genetic decay*”. 
This evolutionary decay was driven by a series of five ‘stratification’ 
events. Each event suppressed X-Y crossing over within a chro- 
mosome segment or ‘stratum’, incorporated that segment into the 
MSY and subjected its genes to the erosive forces that attend the 
absence of crossing over”®. The last of these events occurred 30 
million years ago, 5 million years before the human and Old 
World monkey lineages diverged. Although speculation abounds 
regarding ongoing decay and looming extinction of the human Y 
chromosome” "’, remarkably little is known about how many MSY 
genes were lost in the human lineage in the 25 million years that 
have followed its separation from the Old World monkey lineage. 
To investigate this question, we sequenced the MSY of the rhesus 
macaque, an Old World monkey, and compared it to the human 
MSY. We discovered that during the last 25 million years MSY gene 
loss in the human lineage was limited to the youngest stratum 
(stratum 5), which comprises three percent of the human MSY. 
In the older strata, which collectively comprise the bulk of the 
human MSY, gene loss evidently ceased more than 25 million years 
ago. Likewise, the rhesus MSY has not lost any older genes (from 
strata 1-4) during the past 25 million years, despite its major 
structural differences to the human MSY. The rhesus MSY is 
simpler, with few amplified gene families or palindromes that 
might enable intrachromosomal recombination and repair. We 
present an empirical reconstruction of human MSY evolution in 
which each stratum transitioned from rapid, exponential loss of 
ancestral genes to strict conservation through purifying selection. 

The human Y chromosome no longer engages in crossing over with 
its once-identical partner, the X chromosome, except in its pseudo- 
autosomal regions. During evolution, X-Y crossing over was suppressed 
in five different chromosomal regions at five different times, each 
probably resulting from an inversion in the Y chromosome”’. Each 
of these regions of the Y chromosome then began its own individual 
course of degeneration, experiencing deletions and gene loss. Com- 
parison of the present-day X and Y chromosomes enables identification 
of these five evolutionary ‘strata’ in the MSY (and X chromosome); their 
distinctive degrees of X-Y differentiation indicate their evolutionary 
ages””. The oldest stratum (stratum 1) dates back over 240 million years 
(Myr)’ and is the most highly differentiated, and the youngest stratum 
(stratum 5) originated only 30 Myr ago and displays the highest X-Y 
nucleotide sequence similarity within the MSY’. The five strata and their 
respective decay processes, over tens to hundreds of millions of years of 
mammalian evolution, offer replicate experiments of nature from which 
to reconstruct the trajectories and kinetics of gene loss in the MSY. 


Only the human and chimpanzee MSYs had been sequenced before 
the present study, and they are separated by just 6 Myr of evolution. 
We decided to examine the MSY of a much more distant relative, the 
rhesus macaque (Macaca mulatta), to enable us to reconstruct gene 
loss and conservation in the MSY during the past 25 Myr. We 
sequenced the rhesus MSY using bacterial artificial chromosome 
(BAC) clones and the SHIMS (single-haplotype iterative mapping and 
sequencing) strategy that has previously been used in the human 
and chimpanzee MSYs*"** as well as in the chicken Z chromosome’. 
The resulting sequence is comprised of 11.0 megabases (Mb), is 
complete aside from three small gaps and has an error rate of about 
one nucleotide per Mb. We ordered and oriented the finished sequence 
contigs by fluorescence in situ hybridization and radiation hybrid 
mapping (Supplementary Figs 1-6, Supplementary Table 1, Supplemen- 
tary Files 1, 2 and Supplementary Note 1). 

We then compared the structure of the rhesus Y chromosome to 
that of the human and chimpanzee (Fig. 1). The rhesus Y chromosome 
has virtually no heterochromatin apart from the centromere, and the 
euchromatic segment of the MSY is notably smaller compared to that 
of the human and chimpanzee (Fig. 1). The single pseudoautosomal 
region (PAR) in rhesus corresponds to the short-arm PAR in human 
and to the single PAR in chimpanzee. The precise boundary between 
PAR and MSY is identical in the three species (Supplementary Fig. 7), 
confirming that stratification in all three lineages concluded before the 
divergence of apes from Old World monkeys. 

The euchromatic portions of the rhesus, human and chimpanzee 
MSYs are comprised primarily of two distinct sequence classes: 
X-degenerate and ampliconic. The X-degenerate regions, relics of 
shared X-Y ancestry, are dotted with single-copy homologues of 
X-linked genes. The X-degenerate regions are relatively well conserved 
among the rhesus, human and chimpanzee MSYs, with large blocks of 
homology that are readily identifiable (Supplementary Figs 8 and 9). 
Indeed, the X-degenerate regions are the only portions of the rhesus 
and human MSYs whose sequences can be aligned over distances of 
greater than 50 kb. We found rhesus—human nucleotide divergence 
there to be 9.4% (Supplementary File 3). This is markedly higher than 
the 6.5% divergence that is observed when the rhesus and human 
female genomes are compared”. The difference probably reflects the 
restriction of the MSY to the male germ line, where base-pair sub- 
stitutions are more frequent than in the female germ line’. From these 
data, we calculate the male-to-female mutation rate ratio (%,,) to be 
2.78 (95% confidence interval 2.74-2.81), in agreement with previous 
but less precise estimates'*’®. The X-degenerate sequences in rhesus, 
human and chimpanzee are not entirely colinear, as large-scale re- 
arrangements have occurred in each lineage (Supplementary Figs 8-10). 

For all three species, the MSY’s ampliconic regions are composed of 
long, nearly identical repeat units that are arrayed in either direct or 
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Figure 1 | Comparison of rhesus, human and chimpanzee Y chromosomes. 
a, Schematic representations of rhesus, human and chimpanzee Y 
chromosomes, to scale. Other, single-copy, male-specific sequences that are 


inverted orientation and undergo frequent gene conversion—a process 
that is thought to slow or prevent the decay of genes that reside 
there*'”. Ampliconic genes display testis-specific expression patterns, 
consistent with their having critical roles in spermatogenesis*'?”*. 
Only 0.5 Mb of the rhesus MSY euchromatin is ampliconic, compared 
to 10.2 Mb and 14.7 Mb in human and chimpanzee, respectively 
(Fig. 1, and Supplementary Figs 11 and 12). In human and chimpanzee, 
the ampliconic regions of the MSY feature large palindromes, each 
composed of two inverted repeats (arms) separated by a short spacer. 
The human and chimpanzee MSYs have 8 and 19 palindromes that 
span 5.5 Mb and 7.5 Mb, respectively*"*. By contrast, the rhesus MSY 
has only three palindromes and these collectively span 437 kb (Sup- 
plementary Table 2 and Supplementary Fig. 13). Two of the rhesus 
MSY palindromes are orthologues of human MSY palindromes, 
demonstrating that these structures have been maintained for at least 
25 Myr (Supplementary Fig. 13). 

We identified protein-coding genes in the rhesus MSY using three 
complementary approaches. First, we electronically searched the 
rhesus MSY for homologues of all known human and chimpanzee 
MSY genes and pseudogenes. Second, we searched for homologues 
of all known human X-linked genes, to identify any X-Y shared genes 
that had been lost in both the human and chimpanzee MSY but 
retained in the rhesus MSY. Third, we searched for additional 
rhesus-specific MSY genes using electronic prediction tools and 
high-throughput sequencing of rhesus testis complementary DNA 
(245 Mb in total). We validated each putative gene by verifying tran- 
scriptional activity (Supplementary Fig. 14) and, where applicable, by 
comparing its predicted open reading frame to that of its human 
orthologue (Supplementary Table 3). 

We then compared the catalogues of MSY genes in rhesus, human* 
and chimpanzee”* to infer gene loss and conservation during the past 
25 Myr. To root this analysis in a deep evolutionary context, we first 
reconstructed which of the modern rhesus MSY genes were present on 
the common autosomal ancestor of X and Y (Fig. 2, Supplementary 
Table 4 and Supplementary Note 2). Most ‘ancestral’ MSY genes would 
be expected to have a homologue both on the human X chromosome and 
on the chicken autosomes (chromosomes 1 and 4) that share common 
ancestry with mammalian X and Y chromosomes’». Indeed, 33 genes 
and pseudogenes in the rhesus, human or chimpanzee MSY have their 
most closely related human homologues on the X chromosome 
(Fig. 2), and 29 of these also have homologues within syntenic regions 
of chicken chromosome 1 or 4. Analyses of a more distant outgroup, 
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neither X-degenerate nor X-transposed. b, Sizes (in Mb) of euchromatic 
sequence classes in MSYs. cen, centromere. 


Xenopus tropicalis, revealed that two of the four rhesus MSY genes 
lacking homologues on chicken chromosome 1 and 4 (TSPY and 
AMELY) are X-Y ancestral; they were lost in the chicken lineage after 
divergence from mammals (Supplementary Note 2). A few human 
MSY genes with X homologues are recent additions to the MSY rather 
than remnants of the ancestral autosome pair; PCDH11Y and 
TGIF2LY are located in the human-specific X-transposed region’, 
and the X-linked homologue of VCY is found only in simian primates”. 
We found a total of 30 ancestral MSY genes and pseudogenes in rhesus, 
human or chimpanzee (Fig. 2). 

Within strata 1-4, which collectively comprise the bulk of the 
human MSY, the rhesus and human MSYs possess precisely the same 
18 ancestral genes (Fig. 2). This notable and unanticipated identity 
leads us to conclude that, 25 Myr ago, in the last common ancestor of 
rhesus and human, MSY strata 1-4 also carried these 18 ancestral 
genes (Table 1 and Supplementary Table 5), and that no loss of ancestral 
genes occurred subsequently in either lineage (Supplementary Note 3). 
We note that, within strata 3 and 4, the rhesus and human MSYs carry a 
total of six ancestral pseudogenes that seem to have lost their function 
more than 25 Myr ago (Supplementary Fig. 15). 

The evolutionary stability of ancestral genes in strata 1-4 could be 
explained by purifying selection, which, in the absence of sexual 
recombination, would have preserved critical ancestral genes for tens 
or even hundreds of millions of years. We demonstrated previously 
that purifying selection preserved MSY genes during the past 100,000 
years of human population expansion and migration”’. Comparing 
human and rhesus, we find that most ancestral genes display a ratio 
of nonsynonymous substitution rate to synonymous substitution rate 
that is significantly less than one (Supplementary Note 4, Supplemen- 
tary Table 3 and Supplementary Fig. 16), demonstrating purifying 
selection during the past 25 Myr. 

The pattern of gene loss and conservation in stratum 5, formed only 
5 Myr before the rhesus and human lineages split, is remarkably dif- 
ferent from the pattern in the four older strata. Within the past 30 Myr, 
four ancestral genes have been inactivated or deleted from stratum 5 of 
the MSY in both rhesus and human (Fig. 2 and Supplementary Note 5). 
A fifth ancestral gene, MXRASY, remains active in rhesus (Supplemen- 
tary Fig. 11) but was inactivated by an intragenic deletion in the human 
lineage (Supplementary Fig. 17). Apart from MXRASY, all differences 
in MSY gene content between rhesus and human involve genes that 
were added to the human MSY subsequent to the ape-Old World 
monkey split (Fig. 2 and Supplementary Table 5). 
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Figure 2 | Inventories of genes, both ancestral and added, in rhesus, human 
and chimpanzee MSYs. Ancestral genes grouped by stratum (1-5). In rhesus, 
human and chimpanzee, current status of each MSY gene is indicated by 
shading in one of three columns: present and intact, inactivated pseudogene, or 
absent or deleted. Total numbers of intact genes, pseudogenes (pseudo), and 
absent genes—both ancestral and added—are tallied for each species. For each 
MSY gene, whether the most closely related human homologue is located on the 
X chromosome is shown (right). 


Table 1 | Stratification of X-Y ancestral gene loss in primate MSYs 
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Figure 3 | Kinetics of ancestral gene loss during evolution of five human 
MSY strata. Gene numbers are plotted on a log scale on the y axis, and time (in 
Myr before present) is plotted on the x axis. Filled circles show inferred or 
observed gene numbers in (from left to right) X-Y ancestral chromosome (at 
time of stratum formation), rhesus—chimpanzee-human ancestral MSY (25 
Myr ago), chimpanzee-human ancestral MSY (6 Myr ago), and modern 
human MSY. Dotted and dashed lines represent best-fit curves to data points 
using each of three decay models as indicated. 


Returning to strata 1-4, we note that five ancestral genes have been 
inactivated or lost from the chimpanzee MSY during the past 
6 Myr’*”’, in sharp contrast to the strict conservation of ancestral gene 
content in rhesus and human (Fig. 2). We previously proposed that in 
the chimpanzee lineage promiscuous mating behaviour’', sperm com- 
petition and intense sexual selection that focused on the MSY drove 
rapid evolution and amplification of MSY sequences that are asso- 
ciated with spermatogenesis'*'*. Furthermore, we speculated that in 
the chimpanzee lineage inactivated alleles of some ancestral genes 
became fixed in the population through ‘genetic hitchhiking’; casualties 
of positive but indiscriminate selection operating in the absence of 
sexual recombination in the MSY'*’*”?. Among primate species, 
chimpanzees have a high testis-weight to body-weight ratio, a useful 
indicator of the degree of sperm competition”. Although the rhesus 


Age of stratum 
(millions of years) (from refs 2, 3) 


Number of ancestral genes 
on human X chromosome* 


Number of ancestral genes on MSY 


Last common ancestor+ Rhesus Human Chimpanzee 
Stratum 1 240-320 414 5 5 5 4 
Stratum 2 130-170 88 2 2 2 2 
Stratum 3 80-130 143 8 8 8 5 
Stratum 4 38-44 i? 3 3 3 2 
Stratum 5 29-32 7 2-7 2 1 it 


*Gene numbers from ref. 5, Supplementary Table 4 and Supplementary Note 2. 


+ Gene counts in MSY of a hypothetical rhesus-human-chimpanzee ancestor deduced from observed gene counts in extant species. 
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is similarly promiscuous and has an even higher testis-weight to body- 
weight ratio, the rhesus MSY shows little evidence of intense sexual 
selection. We suggest that in the rhesus lineage, such selection was 
focused on spermatogenesis factors that are encoded elsewhere in 
the genome. This would also account for the virtual absence in rhesus 
of the MSY sequence amplification that is prominent in human and 
even more pronounced in chimpanzee (Fig. 1). 

Our knowledge of all five strata of the MSY, gained through our 
comprehensive comparisons of ancestral gene content in the rhesus, 
human and chimpanzee MSYs, enabled us to reconstruct the kinetics 
and trajectory of human MSY evolution. For each of the five MSY 
strata, we estimated ancestral gene numbers at three points in the 
human evolutionary lineage: in the last common ancestor of human 
and chimpanzee (6 Myr ago), in the last common ancestor of human 
and rhesus macaque (25 Myr ago) and at the time of the stratum’s 
formation, when X-Y differentiation was initiated (from ~30 to >240 
Myr ago; Table 1). For each stratum, we plotted these three estimated 
numbers against evolutionary time, together with the observed number 
of ancestral genes in modern human, and fit a curve (Fig. 3 and Sup- 
plementary Fig. 18). For each of the five strata, a simple two-parameter 
model, using an exponential decay equation that includes a baseline 
constant, provides an excellent fit to our data (Fig. 3 and Supplementary 
Table 6). According to this reconstruction, ancestral gene decay within 
each stratum proceeded rapidly at first—with an ancestral gene half-life 
of less than 5 Myr (Supplementary Table 6)—but then decelerated 
markedly, as the ancestral gene count reached a stable level far below 
its starting point. In our reconstruction, strata 1-4 had already reached a 
stable level before the human lineage diverged from rhesus; after diver- 
gence from rhesus, gene loss in the human lineage was limited to 
stratum 5, the youngest stratum, which stabilized before the human 
lineage diverged from chimpanzee. 

Our empirical reconstruction of MSY evolution is at odds with a linear 
model’*”® and with a simple random decay (exponential) model’*, both 
of which project a continual decline of MSY gene content and cannot 
account for the stability of gene content that we observe over the past 25 
Myr (Fig. 3). Our data are better explained by more complex models for 
MSY gene loss that incorporate a combination of evolutionary forces”. 
Sequencing additional Y chromosomes from animals that represent 
more divergent mammalian lineages will enable refinement of our 
reconstruction of MSY gene kinetics in the human lineage. 


METHODS SUMMARY 

BAC selection and sequencing. The SHIMS (single-haplotype iterative mapping 
and sequencing) strategy’’ was used to assemble a path of sequenced clones selected 
from the CHORI-250 BAC library (http://bacpac.chori.org) and a custom BAC 
library (RMAEX) constructed by Amplicon Express (http://www.genomex.com). 
Fluorescence in situ hybridization analysis. Assays were performed on rhesus 
fibroblast cell line PROO112 from Coriell Institute for Medical Research (http:// 
ccr.coriell.org). Extended metaphase fluorescence in situ hybridization (FISH) and 
interphase FISH were performed as previously described’. 

Radiation hybrid mapping. Nine sequence-tagged site (STS) markers 
(Supplementary Table 7) were tested on a 10,000-rad panel consisting of 185 
hybrid clones’*. A genetic map was constructed and analysed statistically using 
RHMAPPER 1.22 (ref. 29). 

Generation of complementary DNA for polymerase chain reaction with 
reverse transcription (RT-PCR) and 454 sequencing. cDNA was generated from 
total RNA that was isolated from male rhesus tissues using the RNeasy kit (Qiagen). 
For 454 sequencing, cDNA was normalized using the Trimmer kit (Evrogen). 
Alignments and dot plots. Rhesus and human Y sequences were aligned using 
Stretcher (http://bioweb2.pasteur.fr/docs/EMBOSS/stretcher.html) with a gap 
open penalty of 20 and a gap extend penalty of 1. Dot-plot analyses were performed 
using custom Perl codes (http://jura.wi.mit.edu/page/papers/Hughes_et_al_2005/ 
tables/dot_plot.pl). 

Calculation of @,. The male-to-female mutation rate ratio was calculated from 
the human-rhesus Y divergence rate and the human-rhesus autosomal diver- 
gence rate using a previously described method’>”’. 

Modelling ancestral MSY gene loss. We fit a one-phase exponential decay model 
with a baseline constant (shown below) to our data (gene numbers shown in 
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Table 1) using nonlinear regression analysis in GraphPad Prism 5.0. Parameters 
for each stratum are given in Supplementary Table 6. 
One-phase exponential decay model: 


N(t)=(Ny—b)e “+b 


Where Nt) is the number of genes at time t, No is the number of genes within given 
stratum in ancestral autosomal or pseudoautosomal portion of genome, K is the 
decay constant and b is the baseline (approximated by the number of active 
ancestral genes within that stratum on human Y chromosome). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


BAC selection and sequencing. The SHIMS (single-haplotype iterative mapping 
and sequencing) strategy'’ was used to assemble a path of sequenced clones 
selected from the CHORI-250 BAC library (http://bacpac.chori.org) and a custom 
BAC library (RMAEX) constructed by Amplicon Express (http://www.genomex. 
com). The rate of error in the finished sequence was estimated by counting mis- 
matches in overlapping clones. 

FISH analysis. All assays were performed on rhesus fibroblast cell line PROO112 
obtained from the Coriell Institute for Medical Research (http://ccr.coriell.org). 
Extended metaphase FISH and interphase FISH were performed as previously 
described”’. 

Radiation hybrid mapping. Nine STS markers (Supplementary Table 7) were 
tested on a 10,000-rad, male whole-genome panel consisting of 185 hybrid 
clones”. The average retention frequency of the markers tested was 16%, ranging 
from 10-27%. A genetic map was constructed and analysed statistically using 
RHMAPPER 1.22 (ref. 29). 

RT-PCR. Total RNA was isolated from male rhesus tissues (brain, prostate, liver, 
lung and spleen testis; Alpha Genesis) using the RNeasy kit (Qiagen) and cDNA 
was generated. RT-PCR primer sequences and product sizes are listed in Sup- 
plementary Table 8. 

454 sequencing of testis cDNA. Rhesus testis cDNA was generated from total 
RNA isolated using the RNeasy kit (Qiagen). The cDNA was normalized using the 
Trimmer kit (Evrogen) and sequenced on a 454 FLX Titanium machine. 
Alignments and dot plots. Rhesus and human Y sequences were aligned using 
Stretcher (http://bioweb2.pasteur.fr/docs/EMBOSS/stretcher.html) with a gap open 


6 | NATURE | VOL 000 | 00 MONTH 2012 


penalty of 20 and a gap extend penalty of 1. Dot plot analyses were performed using 
custom Perl codes (http://jura.wi.mit.edu/page/papers/Hughes_et_al_2005/tables/ 
dot_plot.pl). 

Calculation of @,. The male-to-female mutation rate ratio was calculated 
using the human-rhesus Y divergence rate (9.40%, 312,840 substitutions per 
3,330,847 sites examined) and the human-rhesus autosomal divergence rate 
(1.385 X 10° substitutions per 2.248 X 10° sites examined; hg18-rheMac2 
alignments downloaded from http://www.genome.ucsc.edu). Miyata’s formula 
was then used to calculate «,, (refs 15, 30):Y/A = 20,,/(1 + %). Confidence 
intervals for ratios of divergence rates were calculated as previously described”°. 
Modelling ancestral MSY gene loss. We modelled the numbers of ancestral genes 
within individual MSY strata as a function of time in millions of years before the 
present by fitting a one-phase exponential decay model with a baseline constant 
(below) to our data (gene numbers shown in Table 1) using nonlinear regression 
analysis in GraphPad Prism 5.0. Parameters for each stratum are given in Sup- 
plementary Table 6. 

One-phase exponential decay model: 


N(t)=(No—b)e"*! +b 


Where N(t) is the number of genes at time f, No is the number of genes within a 
given stratum in the ancestral autosomal or pseudoautosomal portion of genome, 
Kis the decay constant and b is the baseline (approximated by the number of active 
ancestral genes within that stratum on human Y chromosome). 
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Repetitive motor learning induces coordinated 
formation of clustered dendritic spines in vivo 


Min Fu!, Xinzhu Yu!, Ju Lu? & Yi Zuo! 


Many lines of evidence suggest that memory in the mammalian 
brain is stored with distinct spatiotemporal patterns’*. Despite 
recent progresses in identifying neuronal populations involved 
in memory coding*°, the synapse-level mechanism is still poorly 
understood. Computational models and electrophysiological data 
have shown that functional clustering of synapses along dendritic 
branches leads to nonlinear summation of synaptic inputs and 
greatly expands the computing power of a neural network®”*. 
However, whether neighbouring synapses are involved in encoding 
similar memory and how task-specific cortical networks develop 
during learning remain elusive. Using transcranial two-photon 
microscopy", we followed apical dendrites of layer 5 pyramidal 
neurons in the motor cortex while mice practised novel forelimb 
skills. Here we show that a third of new dendritic spines (post- 
synaptic structures of most excitatory synapses) formed during 
the acquisition phase of learning emerge in clusters, and that most 
such clusters are neighbouring spine pairs. These clustered new 
spines are more likely to persist throughout prolonged learning 
sessions, and even long after training stops, than non-clustered 
counterparts. Moreover, formation of new spine clusters requires 
repetition of the same motor task, and the emergence of succedent 
new spine(s) accompanies the strengthening of the first new spine 
in the cluster. We also show that under control conditions new 
spines appear to avoid existing stable spines, rather than being 
uniformly added along dendrites. However, succedent new spines 
in clusters overcome such a spatial constraint and form in close 
vicinity to neighbouring stable spines. Our findings suggest that 
clustering of new synapses along dendrites is induced by repetitive 
activation of the cortical circuitry during learning, providing a 
structural basis for spatial coding of motor memory in the 
mamunalian brain. 

Spines are dendritic protrusions that contain all the essential com- 
ponents for postsynaptic signalling and are thus a good indicator of 
synaptic connectivity'*'*. The clustered plasticity model suggests that 
neighbouring spines tend to transmit similar information to the 
postsynaptic neuron®’. To investigate the formation and functional 
significance of spine clusters during learning, we trained thy1-YFP-H 
mice" with a seed-reaching task’ and followed the dynamics of spines 
on apical dendrites of layer 5 (L5) pyramidal neurons in the motor 
cortex contralateral to the trained limb during different learning 
phases. We found that 32.5 + 2.2% of new spines that formed during 
the acquisition phase of learning (early training, days 1-4) emerged in 
clusters; that is, two or more neighbouring new spines without inter- 
spersed existing spine(s) (Fig. 1a, b). Most such clusters (61 cases) 
comprised two contiguous new spines, and the other two clusters 
comprised three. In contrast, fewer new spine clusters emerged in 
untrained control mice over the same period (6.8 + 4.6%, P< 0.01) 
or in trained mice during the consolidation phase of learning (late 
training, days 13-16; 7.4+ 4.3%, P<0.01; Fig. 1b). In addition to 
clustering of contiguous new spines, we observed a few cases in which 
two or more new spines formed in close vicinity to each other, but with 


up to three existing spines interspersed among them, as well as cases in 
which new filopodia clustered with new spines (Supplementary Fig. 1). 
We incorporated these cases in another set of analyses, in which a 
cluster was defined as a set of new spines/filopodia formed within 
5 um of each other, regardless of the presence or absence of existing 
spine(s) between them (Supplementary Information). These analyses 
again revealed that a significantly higher percentage of new spines 
clustered during early training, compared with that in controls or 
during late training (P< 0.01 for both cases; Supplementary Fig. 2). 
They also showed that filopodia only made a minor contribution to 
new protrusion clusters (Supplementary Fig. 3). More interestingly, 
among the new spines observed at the end of the acquisition phase (day 
4), clustered new spines had a significantly higher survival rate than 
non-clustered ones (that is, individual new spines flanked by two 
existing spines) by training day 16 (P < 0.01), as well as 4 months after 
training stopped (P < 0.05; Fig. 1c). Together, our results reveal that 
motor learning induces coordinated formation of clustered synapses, 
which presumably belong to the same neuronal circuit and persist over 
time to encode motor information. 

Perfection of a motor skill requires repeated practice, usually 
through multiple training sessions. We therefore sought to find out 
whether clustered new spines observed on training day 4 were formed 
within the same training session or across different sessions. We 
imaged the mice three times (on the day before training, and after 1 
and 4 days of training), and found that among new spine clusters 
observed on training day 4, only 2.4% were composed of spines that 
formed together between training days 0 and 1. On the other hand, 
43.9% of clusters were composed of spines formed between days 1 and 
4, and the remaining 53.7% of clusters consisted of one spine formed 
between days 0 and 1 (the first new spine) and another spine formed 
between days 1 and 4 (the second new spine). Thus most new spine 
clusters emerged through recurrent training sessions. To determine 
how the formation of the second new spine in a cluster correlates with 
functional changes of the first new spine, we categorized first new 
spines into three groups based on their survival and neighbouring 
spine addition: transient new spines (formed on training day 1 but 
lost by day 4); persistent clustered new spines (formed on training 
day 1, survived until day 4, with an adjacent new spine formed between 
days 1 and 4); and persistent non-clustered new spines (formed on 
training day 1, survived until day 4, with no adjacent new spine forma- 
tion) (Fig. 1d). As spine head size closely correlates with synaptic 
strength, we followed head sizes of first new spines over time. On 
training day 1, we found that head sizes of both transient and persistent 
new spines were significantly smaller than those of existing stable 
spines along the same dendrite (P < 0.001 for both cases, Supplemen- 
tary Fig. 4). By training day 4, head sizes of persistent clustered new 
spines increased significantly (P< 0.01; Fig. le and Supplementary 
Fig. 5a), whereas head sizes of persistent non-clustered new spines 
remained comparable to day 1 (P>0.2; Fig. 1f and Supplemen- 
tary Fig. 5b). Because spine head size is a good proxy for synaptic 
strength, these data suggest that formation of the second new spines 
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accompanies synaptic potentiation at first new spines during motor 
learning. It is consistent with previous reports that long-term poten- 
tiation at a single spine can facilitate formation and potentiation of 
neighbouring spines’*””. 

Different sets of synapses have been shown to be involved in differ- 
ent motor tasks'*. We therefore trained the same mice sequentially 
with two motor skills (cross-training) to determine if spines induced 
by different motor tasks cluster. Cross-training started with the reach- 
ing task on day 1 and then switched to the capellini-handling task, 
which also requires forelimb coordination, on days 2-4 (Fig. 2a and 
Supplementary Table 1). We found that 12.3 40.4% new spines 
formed during the capellini-handling task between days 1 and 4, 
among which 28.4 + 2.8% occurred in clusters (Fig. 2b-d). Both the 
spine formation rate and the percentage of clustered new spines were 
comparable to those in mice continuously trained with the reaching 
task (reach-only) (P > 0.5 in both cases), and were significantly higher 
than those in control mice over the same period of time (P< 0.01 in 
both cases, Fig. 2c, d). Thus, the capellini-handling task itself 
can induce clustered spine formation. However, only 3.3 + 2.1% of 
capellini-handling-induced new spines clustered with reaching- 
induced new spines in cross-training. This contrasts with the outcome 
of reach-only training (13.8 + 1.0%, P< 0.01, Fig. 2e), suggesting that 
new spines induced by different tasks have a low incidence of cluster- 
ing with each other. To characterize further the task-specific nature of 
clustered spine formation, we housed animals in a motor enriched 
environment with daily change of motor challenges (Fig. 2a; see 
Methods). Motor enrichment also robustly enhanced spinogenesis: 
13.7 + 0.8% new spines formed between days 1 and 4, comparable 
to the percentages under reach-only and cross-training conditions 
(P>0.1 for both cases). However, only 12.6+1.1% of these new 
spines appeared in clusters, a percentage comparable to controls 
(P > 0.2, Fig. 2d) but significantly lower than that under reach-only 
or cross-training conditions (P< 0.01 for both cases). Together these 
data indicate that, whereas novelty in learning stimulates spinogenesis, 
repetitive activation of the same cortical circuit is crucial in clustered 
spine formation (Supplementary Fig. 6). 

The phenomenon of learning-induced, coordinated spinogenesis led 
us to investigate further the spatial distribution of new spines. We first 
examined the distance between a new spine (n) and its nearest existing 
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Figure 1 | Acquisition of a novel motor skill 
induces formation of spine clusters. a, Repeated 
imaging of the same dendritic branch during motor 
learning reveals that a second new spine that 
formed between days 1 and 4 (red arrowhead) is 
located next to a stabilized new spine that had 
formed on day 1 (blue arrowhead). Scale bar, 1 jim. 
b, A higher percentage of new spines formed in 
clusters over 4 days during early training (n = 18 
mice), compared with control (n = 7) and late 
training (n = 4). c, Clustered new spines observed 
on training day 4 have a higher survival rate than 
non-clustered counterparts by the end of the 16- 
day training (n = 6), as well as 4 months after 
training stops (n = 4). d, New spines formed on 
training day 1 are classified according to their fate 
and neighbouring spine formation. e, Spine head 
sizes of persistent clustered new spines increase 
between training days 1 and 4. f, Spine head sizes of 
persistent non-clustered new spines show no 
change between training days 1 and 4. Spine head 
size is quantified by the normalized spine head 
diameter, defined as the ratio of the spine head 
diameter to the adjacent dendritic shaft diameter. 
*P< 0.05, **P< 0.01. Error bars, s.e.m. 


oClustered 
aNon-clustered 


Day 0-4-16 Day 0-4-adult 


Non-clustered 
new spines 


1.7 


Day 1 


Day 4 


spine (s) (D,_;, Fig. 3a) in control mice. We then simulated D,_, dis- 
tribution under the null hypothesis that new spines form uniformly and 
independently along the dendrite (see Methods). Compared with simu- 
lation results, the median of measured values of D,_, was significantly 
larger (Fig. 3b), and the cumulative probability distribution of measured 
values of D,, was shifted towards longer distances (Fig. 3c). These 
results suggest that new spines are not randomly dispersed along 
dendritic segments, and their apparent avoidance of existing stable 
spines under control conditions is consistent with the idea that 
neighbouring spines share and compete for local resources'*”!. 

To determine if motor learning alters the spatial distribution of new 
spines, we examined values of D,_, in mice trained with the reaching 
task. We found that the distance between a new spine formed on training 
day 1 (n,) and the nearest existing spine (D,)_,) was comparable for 
trained and control mice (P > 0.7). We classified new spines formed 
between training days 1 and 4 (n) into two categories: clustered n, (that 
is, those that formed next to a stabilized first new spine; Fig. 3d) and non- 
clustered nz (those that did not form next to a stabilized first new spine). 
We found that clustered n, were significantly closer to their nearest stable 
spines (n, or a stable spine existing since day 0; Fig. 3d) (Dy2-s, clustered) 
than were n,; (D,;_;) (P< 0.05). In contrast, the distance between a 
non-clustered n, and its nearest stable spine (Dy2-s, non-clustered) WaS 
comparable to D,;_, (P > 0.9, Fig. 3e). In addition, when an n, formed 
between two adjacent stable spines, the distance between the two stable 
spines (D,_»1-s) was comparable for control and trained mice (P > 0.7; 
Supplementary Fig. 7). However, the distance between a stabilized n, 
and the adjacent stable spine, between which a clustered n, formed 
(Dyi-n2-s)» Was significantly smaller than the distance between two 
adjacent stable spines, between which a non-clustered n, formed 
(Ds-n2-s3 P< 0.01; Supplementary Fig. 7). These results suggest that 
learning-induced clustered new spines can overcome the spatial con- 
straint of existing spines and be packed into tighter dendritic space. 

Recent studies have shown that dendritic spines are dynamic in the 
living brain, and that rearrangement of cortical connections through 
de novo growth and loss of spines provides a structural substrate for 
experience-dependent plasticity** *°. Built upon these works, our study 
reveals a novel spatial rule of spinogenesis during motor learning. We 
found that learning-induced new spines tend to form in small clusters 
(mostly pairs). The correlation between the emergence of the second 
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Figure 2 | Clustered new spines form over multiple training sessions of the 
same, but not different, motor tasks. a, Timelines of reach-only, cross- 
training and motor enrichment experiments. b, Repeated imaging of the same 
dendritic branch revealed that two neighbouring new spines (arrowheads) 
formed between days 1 and 4 during cross-training. Scale bar, 1 jum. c, Higher 
percentages of new spines formed between days 1 and 4 in reach-only, cross- 
training and motor enrichment, compared with controls. d, Higher percentages 
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Figure 3 | The spatial distribution of new spines along dendrites. 

a, Schematic illustrating the measurement of D,_,. b, The median of measured 
values of D,_, (red circle) is significantly larger than that of simulated values of 
Dy-; (box plot of results from 1,000 simulations, with whiskers representing the 
minimum and the maximum) in control mice. The simulation is based on the 
null hypothesis that new spines are added independently and uniformly along a 
linear dendrite. c, Cumulative probability distribution of measured D,,, is 
shifted towards longer distances than the simulated D,_, in control mice. 


of new spines formed in clusters between days 1 and 4 in reach-only and cross- 
training, compared with controls. e, A higher percentage of new spines that 
formed between days 1 and 4 clustered with new spines that had formed 
between days 0 and 1 in the reach-only condition, compared with controls. 
Number of mice examined: six control, nine reach-only, five cross-training and 
six motor enrichment. **P < 0.01, ***P < 0.001. Error bars, s.e.m. 
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d, Schematic illustrating the measurement of Dy,2_s, clustered: The nearest spine to 
a clustered n, could be either a persistent first new spine (n;) or a stable spine 
existing since day 0, depending on relative n, location. e, D,)_, in control mice is 
comparable to that of trained mice. In trained mice, Dy2-s, clustered iS 
significantly smaller than D,;_,, whereas Dy2-s, non-clustered is comparable to 
D,\-s. The number of spines analysed in each condition is indicated on each 
column. *P < 0.05. Error bars, s.e.m. 
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new spine and the strengthening of the first new spine also suggests their 
potential participation in the same neuronal circuit. These findings 
support the clustered plasticity model, which postulates that synapses 
located close together along the same dendritic branch are more likely 
to be allocated for the same information than synapses dispersed 
throughout the dendritic arbor’. Indeed, in the mouse auditory cortex, 
although spines tuned for different frequencies are highly interspersed, 
26% of neighbouring spines exhibit similar effective frequencies, much 
more frequently than anticipated from random distribution (10%)”*. 
Therefore, although neurons tend to maximize their overall connec- 
tions”, clustered plasticity ensures strengthening of circuit-specific 
connections and enables spatial coding for task-related information. 

Previous electron microscopy studies have revealed that neighbouring 
spines can form synapses with the same axon***° (see Supplemen- 
tary Fig. 8a, c, e). Positioning multiple synapses between a pair of 
neurons in close proximity allows nonlinear summation of synaptic 
strength, and potentially increases the dynamic range of synaptic 
transmission well beyond what can be achieved by random positioning 
of the same number of synapses. Alternatively, clustered new spines 
may synapse with distinct (but presumably functionally related) 
presynaptic partners (Supplementary Fig. 8b, d). In this case, they 
could potentially integrate inputs from different neurons nonlinearly 
and increase the circuit’s computational power. Distinguishing 
between these two possibilities would probably require circuit recon- 
struction by electron microscopy following in vivo imaging to reveal 
the identities of presynaptic partners of newly formed spines. 

Profiling spine formation during novel experiences, our data revealed 
a critical role of repetitive activation of the same neuronal circuit. The 
fact that the second new spine in a cluster can overcome the spatial 
constraint imposed by existing spines suggests that repetitive activation 
of a neuronal circuit may modify or reallocate local ‘resources’ for 
spinogenesis. Such resources may be permissive or instructive 
molecular cues at the pre- or postsynaptic site, or the availability of 
suitable partners (for example, axonal boutons). Understanding 
the nature and regulation of such resources may hold the key to 
elucidating the cellular mechanisms of clustered spine formation. It 
will be conducive to the development of tools to label and manipulate 
specific synaptic populations, and ultimately to the dissection of 
the causal relationship between synaptic dynamics and learning. 


METHODS SUMMARY 


YFP-H line mice'* expressing yellow fluorescent protein (YFP) in a small subset of 
cortical neurons were used in all the experiments. Mice of both sexes were trained 
with different motor-skill tasks or housed in a motor-enriched environment, 
starting at 1 month of age (see Methods). The procedures for transcranial two- 
photon imaging and quantification of spine dynamics have been described previ- 
ously'?'*. ImageJ was used to measure spine head size, as well as inter-spine 
distances. Simulation was performed with custom-written codes in Matlab 
(MathWorks) and statistical analyses were performed using GraphPad Prism 5 
(GraphPad Software) (see Methods). All data were presented as mean + s.e.m. 
P values were calculated using the Mann-Whitney U-test for independent 
samples, and the Wilcoxon signed-rank test for paired samples. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Experimental animals. Thy1-YFP-H line mice’ were obtained from the Jackson 
Laboratory. Mice were group-housed and bred in the University of California, 
Santa Cruz, animal facility, with all experiments performed in accordance with 
approved animal protocols. 

Motor skill training and motor enrichment. Both the mouse single-seed reaching 
task and capellini-handling task protocols have been previously described'». ‘Motor- 
enriched’ mice were reared in groups of 8-12 in large cages (90 cm 25cm X 15cm) 
containing various toys, such as ropes, ladders, chains, hanging mesh/bars etc., all of 
which required substantial motor coordination. The nature of toys was changed on a 
daily basis. Control mice were housed in standard mouse cages, with up to five mice 
per cage. 

Surgical procedure for in vivo transcranial imaging. The procedure for 
transcranial two-photon imaging has been described previously'’*'. Trained mice 
were imaged immediately after each training session. 

Data quantification. All analyses of spine dynamics were done using ImageJ 
software, blinded for experimental conditions. Quantification criteria of dendritic 
spines have been described previously'®. All dendritic protrusions were tracked 
manually in three-dimensional stacks to ensure the consistency of protrusion iden- 
tification across imaging sessions, despite possible tissue movement or rotation. The 
number and location of dendritic protrusions (defined as protrusion length larger 
than one-third of dendritic shaft diameter) were identified in each view. Filopodia 
were identified as long, thin structures with the ratio of head diameter to neck 
diameter being less than 1.2 and the ratio of length to neck diameter being greater 
than 3. The remaining protrusions were classified as spines. Formation and 
elimination of spines and filopodia were determined by comparing images collected 
at two different time points. Spines or filopodia were considered identical between 
the two images if they were within 0.7 tum of their expected positions, based on their 
spatial relationship to adjacent landmarks and/or their positions relative to 
immediately adjacent spines. A stable spine was defined as a spine that was present 
in both images. A new spine was a spine that appeared in a subsequent image but 
was absent from the initial image. Percentages of formed and eliminated spines (or 
dendritic protrusions) were normalized to the number of spines (or dendritic 
protrusions) in the initial image. Spine diameter analyses have been previously 
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described’. Because imaging and animal conditions varied over time, the ratio of 
the spine head diameter to the adjacent dendritic shaft diameter was used as the 
normalized spine head diameter. Measurement of spine head intensity, as described 
previously’, was also performed to confirm these spine size results. Briefly, we 
determined the signal intensity (defined as the sum intensity of all pixels composing 
the spine in the best focal plane) and subtracted the background intensity (defined 
as the sum intensity of a region composed of the same number of pixels as the spine 
but with no YFP-labelled structure). The difference was then divided by the mean 
intensity of the adjacent dendritic shaft (defined similarly as the difference between 
the mean signal intensity of the shaft and the mean background intensity) to correct 
for varying imaging conditions. The final value was termed ‘integrated spine 
brightness.’ All distance measurements were done in ImageJ. To simulate spine 
formation, we first obtained the relative location of stable spines by measuring 
inter-spine distances along traced dendrites in seven control animals, and conca- 
tenated dendritic segments from each animal into a single ‘synthetic dendrite.’ We 
then used custom-written Matlab codes to simulate the addition of new spines. As 
we observed, two spines can extend from the same linear location along the 
dendritic segment and point towards different directions, given the cylindrical 
shape of dendrites. In our analysis and simulation, we made the simplifying 
approximation that the dendritic segment is one-dimensional rather than a tube. 
Therefore, zero inter-spine distance in our analysis represents two spines over- 
lapping in linear position but actually located at different sites around the circum- 
ference of the dendritic segment. In each round of simulation, the same numbers of 
new spines as observed in experiments were generated independently and uni- 
formly along synthetic dendrites. The distance between each new spine and its 
nearest stable spine (D,_,) was calculated. The simulation was repeated 1,000 times 
and the resultant data were pooled to compute the simulated sample median and 
the cumulative probability curve. All data were presented as mean + standard error 
of mean (s.e.m.). P values were calculated using the Mann-Whitney U-test for 
independent samples, and the Wilcoxon signed-rank test for paired samples. 
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Circadian rhythms govern cardiac repolarization 


and arrhythmogenesis 
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Sudden cardiac death exhibits diurnal variation in both acquired 
and hereditary forms of heart disease’’, but the molecular basis of 
this variation is unknown. A common mechanism that underlies 
susceptibility to ventricular arrhythmias is abnormalities in the 
duration (for example, short or long QT syndromes and heart 
failure)*° or pattern (for example, Brugada’s syndrome)° of myo- 
cardial repolarization. Here we provide molecular evidence that 
links circadian rhythms to vulnerability in ventricular arrhythmias 
in mice. Specifically, we show that cardiac ion-channel expression 
and QT-interval duration (an index of myocardial repolarization) 
exhibit endogenous circadian rhythmicity under the control of a 
clock-dependent oscillator, kriippel-like factor 15 (KIf15). KIf15 
transcriptionally controls rhythmic expression of Kv channel- 
interacting protein 2 (KChIP2), a critical subunit required for 
generating the transient outward potassium current’. Deficiency or 
excess of KIf15 causes loss of rhythmic QT variation, abnormal repo- 
larization and enhanced susceptibility to ventricular arrhythmias. 
These findings identify circadian transcription of ion channels as 
a mechanism for cardiac arrhythmogenesis. 

Sudden cardiac death from ventricular arrhythmias is the principal 
cause of mortality from heart disease worldwide and remains a major 
unresolved public health problem. The incidence of sudden cardiac 
death exhibits diurnal variation in both acquired and hereditary forms 
of heart disease’”. In the general population, the occurrence of sudden 
cardiac death increases sharply within a few hours of rising in the 
morning, anda second peak is evident in the evening hours’. In specific 
hereditary disorders, for example, Brugada’s syndrome, fatal ventricular 
arrhythmias often occur during sleep’. A common mechanism in both 
acquired and hereditary forms of heart disease that enhances suscept- 
ibility to ventricular arrhythmias is abnormal myocardial repolariza- 
tion’. Clinically, three common types of alterations in myocardial 
repolarization are evident on the surface electrocardiogram (ECG). 
First, prolongation of repolarization is seen in acquired disorders 
(for example, heart failure)’ and congenital disorders (for example, 
long QT syndrome)’. Second, shortening of repolarization is found 
in the short QT syndrome’. Third, early repolarization is the hallmark 
ECG finding in Brugada’s syndrome’. Interestingly, all three modifica- 
tions of repolarization increase vulnerability to ventricular arrhythmias®. 
Despite rigorous investigation of the biophysical and structural char- 
acteristics of ion channels that control myocardial repolarization, the 
molecular basis for the diurnal variation in occurrence of ventricular 
arrhythmias remains unknown. 

Biological processes in living organisms that oscillate with a periodicity 
of 24 h are said to be circadian. This cell-autonomous rhythm is coor- 
dinated by an endless negative transcriptional-translational feedback 


loop, commonly referred to as the biological clock’. Several physio- 
logical parameters in the cardiovascular system such as heart rate, 
blood pressure, vascular tone, QT interval and ventricular effective 
refractory period exhibit diurnal variation’? ’*. Recent studies have also 
identified a direct role for the biological clock in regulating cardiac 
metabolism, growth and response to injury'*. Previous studies have 
also reported that expression of repolarizing ion channels and ionic 
currents (J,.) exhibit diurnal changes’’. However, a potential link 
between circadian rhythms and arrhythmogenesis remains unknown. 
We made the serendipitous observation that K/f15 expression exhibits 
endogenous circadian rhythmicity in the heart (Fig. 1a). Gene expres- 
sion microarrays in hearts of mice that are deficient in K/f15 led us to 
identify KChIP2 (also called KCNIP2), the regulatory B-subunit for the 
repolarizing transient outward potassium current (J,,) as a putative 
target for this factor in the heart. These observations led us to question 
whether the circadian clock may regulate rhythmic variation in repo- 
larization and alter susceptibility to arrhythmias through KI/f15. 

First, we explored mechanisms through which the circadian clock 
regulated rhythmic expression of K/f15 in the heart. Examination of 
approximately 5kb of the promoter region of K/f15 revealed four 
canonical “E-box’ regions, that is, consensus binding sites for CLOCK 
and its heterodimer BMALI (also called ARNTL), which are essential 
transcription factors involved in the circadian clock (Supplementary 
Fig. la, inset). Consistent with this finding, KIf15 luciferase (approxi- 
mately 5 kb) was activated in a dose-dependent manner by the 
CLOCK-BMALI heterodimer (Supplementary Fig. 1a). To confirm 
this interaction, we performed chromatin immunoprecipitation 
(ChIP) and identified rhythmic variation in BMAL1 binding to the 
KIf15 promoter in the hearts of wild-type mice, but not in the hearts of 
BMALI-null mice (Fig. 1b). In accordance with the observations 
above, the expression of KIf15 was disrupted in Bmall-null, and 
Per2- and Cry1-null hearts (Supplementary Fig. 1b). Thus, our data 
strongly suggest that the circadian clock directly regulates the oscil- 
lation of KIf15 in the heart. 

To determine whether myocardial repolarization and ion-channel 
expression exhibit ‘true’ (endogenous) circadian rhythms—that is, 
oscillate in the absence of external cues such as light—wild-type mice 
were placed in constant darkness for 36h and telemetry-based ECG 
intervals were measured every 2 h for 24h. Under these conditions, the 
heart rate and the QT interval corrected to heart rate (QTc) were both 
rhythmic and exhibited true endogenous circadian rhythmicity 
(Fig. 1c, d). Next, to examine whether expression of repolarizing ion 
channels had endogenous circadian rhythms, mice were placed in 
constant darkness for 36 h, and hearts were collected every 4 h over 
a 24-h period. The expression of the o-subunit for the transient outward 
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Figure 1 | KIf15 expression, ECG QTc interval 
and expression of repolarizing ion channels 
exhibit endogenous circadian rhythm. a, K/f15 
expression exhibits endogenous circadian variation 
in wild-type (WT) hearts from mice in constant 
darkness (1 = 4 per time point). CT, circadian 
time. b, Effect of BMAL1 ChIP on the K/f15 
promoter, showing rhythmic variation in binding 
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endogenous circadian variation in constant 
darkness (n = 4). d, Representative ECGs from 
conscious mice after 36 h in constant darkness at 
CT 0 and CT 12. e, f, Endogenous circadian 
variation in transcripts for Kend2 and KChIP2 in 
wild-type hearts measured every 4 h after 36 h in 
constant darkness (n = 4 per time point). Error 
bars, mean + s.e.m. 
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potassium current (J,,), Kv4.2 (encoded by Kend2) (Fig. le), and the 
regulatory B-subunit, KChIP2 (Fig. 1f), exhibit endogenous circadian 
rhythmicity, as did components of the circadian clock in the heart 
(Supplementary Fig. 2). In contrast, the expression of two other major 
repolarizing currents in the murine ventricle, Kv1.5 (the «-subunit for 
the ultra-rapid delayed rectifier potassium current) and Kir2.1 (the 
o-subunit for the inward rectifier potassium current), did not reveal 
notable rhythmic variation (Supplementary Fig. 3). In addition, we 
observed a 24-h rhythm in the oscillation of Bmall, KIf15 and 
KChIP2 after serum shock in cultured neonatal rat ventricular myocytes 
(Supplementary Fig. 4). These data indicate that myocardial repolariza- 
tion and the expression of some repolarizing ion channels exhibit an 
endogenous circadian rhythm. 


Relative fold 


Next, to elucidate the role of K/f15 in regulating rhythmic changes 
in repolarization, we used complementary in vivo loss- and gain-of- 
function approaches in mice. For loss-of-function, a previously 
described systemic K/f15-null mouse was used"*; for gain-of-function, 
a cardiac-specific KIf15 transgenic (KIf15-Tg) mouse driven by an 
attenuated «-myosin heavy chain (%-MHC) promoter was developed 
(Supplementary Fig. 5). First, we examined whether rhythmic expres- 
sion of Kcnd2 or KChIP2 was altered in the KIf15-deficient state. Kcnd2 
expression exhibited altered rhythmic variation in KIf15-null mice 
with reduced expression at zeitgeber time 6 (ZT6), and increased 
expression at ZT22 compared to wild-type controls (Fig. 2a). KChIP2 
expression was devoid of any discernable rhythm in the K/f15-null mice 
and sustained reduction was observed at all time points (Fig. 2b, c and 


Figure 2 | KIf15 regulates KChIP2 expression in 
the heart. a, Kcnd2 mRNA expression exhibits 
diurnal rhythm in wild-type mice (P = 0.0023), but 
in K/f15-null hearts (P not significant) the rhythm 
is abnormal with reduced expression at zeitgeber 
time 6 (ZT6) and increased expression at ZT22 (n 
= 4 per time point per group). b, KChIP2 mRNA 
expression exhibits no rhythmic variation in K/f15- 
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Supplementary Fig. 6a). Next, we examined whether Kend2 or KChIP2 
serve as transcriptional targets for K/f15 in the heart. Adenoviral over- 
expression of KIf15 in neonatal rat ventricular myocytes robustly 
induced KChIP2 expression but had no effect on Kend2 expression 
(Supplementary Fig. 6b). Notably, in KIf15-Tg hearts, expression of 
KChIP2 was twofold greater but with no effect on Kcnd2 expression 
(Fig. 2d, e). Examination of the KChIP2 promoter region revealed 
numerous consensus kriippel-binding sites, that is, C(A/T)CCC (Sup- 
plementary Fig. 7a). The activity of KChIP2 luciferase was induced by 
full-length KLF15 but not by a mutant that lacked the zinc-finger 
DNA-binding domain (Supplementary Fig. 7b). To identify the specific 
KIf15 binding site, deletion constructs of the KChIP2 promoter were 
generated, and transcriptional activity was mapped to the proximal 
555 bases (Supplementary Fig. 7a). Mutation of one kriippel-binding 
site within this region (A1) was sufficient to cause complete loss of 
activity in the full-length KChIP2 promoter (Supplementary Fig. 7c). 
Chromatin immunoprecipitation of Flag-KLF15 from K/f15-Tg hearts 
confirmed that KLF15 was enriched on the endogenous KChIP2 pro- 
moter (Fig. 2f). Importantly, the oscillation of several components of 
the core clock machinery was minimally affected in the KIf15-deficient 
state (Supplementary Fig. 8). In addition, the expression levels of clock 
genes in KIf15-Tg hearts were similar to their controls at ZT6 
(Supplementary Fig. 8). This suggested that the endogenous clock is 
dependent on KI/f15 to orchestrate rhythmic changes in KChIP2 
expression. Consistent with this observation, the expression of K/f15 
(Supplementary Fig. 1b) and KChIP2 (Supplementary Fig. 9) were 
altered in a similar fashion in Bmall-null, and Per2- and Cry1-null 
mice. These data support the idea that KChIP2 is a direct transcrip- 
tional target for K/f15 in the heart. 

We next examined whether K/f15-dependent regulation of KChIP2 
could be responsible for rhythmic day-night variation in myocardial 
repolarization. Analysis of telemetry-based ECGs revealed that rhythmic 
QTc interval variation was indeed abrogated in both K/f15-null and 
KIf15-Tg mice (Fig. 3a—d). In the K/f15-deficient state, the ECG QTc 
interval was prolonged in the dark phase and failed to oscillate (Fig. 3a, 
c). This occurred despite KIf15-null mice having similar heart rates to 
their wild-type counterparts (Supplementary Fig. 11). In contrast, the 
KIf15-Tg mice had persistently short QT intervals with no rhythmic 
day-night variation (Fig. 3b, d). Again, this occurred despite minimal 
difference in heart rates when compared to wild-type controls (Sup- 
plementary Fig. 11). Next, we examined whether transient outward 
current (J;, fast)-dependent changes in repolarization in isolated 
myocytes were responsible for the ECG changes mentioned above in 
KIf15-null and KIf15-Tg mice. In KIf15-null mice, there was a marked 
reduction in I,, gt density (Fig. 3e) and prolongation of action potential 
duration (APD) (Fig. 3g). In contrast, KIf15-Tg mice exhibited a sub- 
stantial increase in I, fast density (Fig. 3f) with a dramatic shortening of 
APD (Fig. 3h). In the K/f15-Tg mice, in addition to short QT intervals, 
we observed ST-segment changes indicative of early repolarization that 
are similar to ECG findings in Brugada’s syndrome’ (Fig. 3b, arrows). 
Our data suggest that K/f15-dependent transcriptional regulation of 
rhythmic KChIP2 expression in murine hearts plays a central part in 
rhythmic variation in ventricular repolarization. 

Next, we examined whether excessive prolongation or shortening of 
repolarization could alter arrhythmia susceptibility and survival. KIf15- 
null mice show no spontaneous arrhythmias on ECG telemetry, hence 
we used intracardiac programmed electrical stimulation to examine 
arrhythmia susceptibility. In contrast to wild-type mice, a marked 
increase in occurrence of ventricular arrhythmias was seen in KIf15- 
null mice (Fig. 4a). Notably, KIf15-Tg mice exhibit spontaneous 
ventricular arrhythmias on ECG telemetry (Fig. 4b) and succumb to 
~35% mortality by 4 months of age (three out of eight deaths in K/f15- 
Tg versus no deaths out of eight in wild-type non-transgenic controls, 
data not shown). As the K/f15-null mice show no evidence of overt 
ventricular dysfunction, apoptosis or fibrosis'®’” in the basal state, the 
enhanced susceptibility to arrhythmias is probably primarily driven by 
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Figure 3 | Deficiency or excess of KIf15 modulates rhythmic variation in 
repolarization. a, b, Representative ECGs from wild-type versus K/f15-null 
mice, and wild-type (non-Tg) versus K/f15-Tg mice at ZT2 and ZT14. Note the 
ST-segment abnormalities in KIfI5-Tg mice (arrows). c, QTc interval exhibits 
24-h rhythm in wild-type mice; this rhythm is abrogated with prolonged QTc in 
the dark phase in K/f15-null mice (n = 4 for wild type, n = 4 for K/f15-null). 
d, KIf15-Tg mice exhibit persistently short QT intervals with no day-night 
rhythmic variation compared to wild-type (non-Tg) controls (n = 3 for wild 
type, n = 4 for KIf15-Tg). e, f, Representative outward current recordings from 
all study groups and summary data for the amplitude of I,, fs: measured at 
60 mV with an average time of decay of 45 + 5 ms (n = 10 for wild type, n = 13 
for Kif15-null; n = 14 for wild type (non-Tg), and n = 19 for KIf15-Tg). 

g, h, Representative ventricular action potentials from all study groups with 
summary data in bar graphs (n = 10 for wild type, n = 13 for K/f15-null; n = 14 
for wild type (non-Tg), and n = 19 K/f15-Tg). Error bars, mean + s.e.m., 

*P < 0.05. APDgo, action potential duration measured at 90% repolarization. 
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Figure 4 | KIf15 deficiency or excess increases susceptibility to ventricular 
arrhythmias. a, Programmed electrical stimulation in wild-type and K/f15-null 
mice. Onset of ventricular tachycardia after premature stimuli is shown 
(arrows) in K/f15-null mice (none of the seven wild-type mice were inducible 
but three of the four K/f15-null mice were inducible; *P < 0.05). b, Spontaneous 
ventricular arrhythmia in KIf15-Tg mice. (none of the four wild-type mice 
exhibited spontaneous arrhythmias but three of the four K/f15-Tg mice 
exhibited ventricular arrhythmias; *P < 0.05). VT, ventricular tachycardia. 


abnormalities in repolarization. Our studies demonstrate that both 
deficiency and excess of KIf15 impair temporal variation in cardiac 
repolarization and greatly increase susceptibility to arrhythmias. 

Although our finding of circadian control of KChIP2 by KIf15 
establishes the principle that circadian rhythms may contribute to 
arrhythmogenesis, we note that K/f15 minimally affects Kcnd2 expres- 
sion that also exhibits circadian rhythm (Fig. 1f). However, Kend2 
expression was disrupted in Bmall-null and Per2- and Cryl-null 
hearts, and this is indicative of a direct regulation by the circadian clock 
(Supplementary Fig. 12). Consistent with this observation, cardio- 
myocytes from Bmall-null mice exhibit marked action potential pro- 
longation due to near-complete elimination of the fast component of 
the transient outward potassium current (Supplementary Fig. 13). This 
raises the possibility that additional factors—perhaps components of 
the circadian clock or unidentified transcriptional regulators—may 
also affect temporal variation in electrophysiological parameters and 
arrhythmogenesis. Future studies in cardiac-specific deletion of clock 
components would be necessary to confirm whether the ion channel 
rhythms are cell autonomous, and their role in regulating cardiac 
electrophysiology. 

Our study provides the first mechanistic link between endogenous 
circadian rhythms and the cardiac electrical instability that is most 
often associated with sudden cardiac death in humans (Supplemen- 
tary Fig. 14). Specifically, we show that K/f15-dependent rhythmic 
transcription of KChIP2 regulates the duration and pattern of repolar- 
ization and susceptibility to arrhythmias in mice. As the occurrence of 
sudden cardiac death in acquired and hereditary forms of human heart 
disease follows a distinct diurnal pattern’”, these observations offer 
new insights into unrecognized triggers of electrical instability in the 
heart. However, in contrast to murine repolarization, which is largely 
dependent on J,,, human repolarization occurs through a complex 
interaction of multiple repolarizing ionic currents. Thus, additional 
studies will be needed to develop a comprehensive understanding of 
the link between the circadian clock and electrophysiological properties 
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of the human heart. Nevertheless, these data may provide a mechanistic 
foundation for future efforts to prevent or treat cardiac arrhythmias by 
modulating the circadian clock through behavioural or pharmaco- 
logical means. 


METHODS SUMMARY 


Mice used in the present study, messenger RNA quantification using polymerase 
chain reaction with reverse transcription (RT-PCR), promoter reporter analysis, 
western immunoblot analysis, chromatin immunoprecipitation, telemetry ECG 
and interval analysis, isolated myocyte studies for action potential or I,, measure- 
ments, in vivo electrophysiological studies for arrhythmia susceptibility, cosinor 
analysis for rhythm assessment, and statistical methods are detailed in the 
Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Mice. All animal studies were carried out with permission, and in accordance with, 
animal care guidelines from the Institutional Animal Care Use Committee 
(IACUC) at Case Western Reserve University and at collaborating facilities. 
Wild-type male mice on C57BL6/J background (Jackson Laboratory) were bred 
in our facility and used for circadian studies. Mice were housed under strict light- 
dark conditions (lights on at 6:00 and lights off at 18:00) and had free access to 
standard chow and water, and were minimally disturbed for 4-6 weeks before the 
final experiment. Generation of systemic K/f15-null mice was as described previ- 
ously’*. K/f15-null mice have been backcrossed into the C57BL6/J background for 
over ten generations'* and the BMAL1 mice were bred as previously described”. 
For K/f15-Tg mice, Flag-KLF15 was cloned downstream of an attenuated ¢-myosin 
heavy-chain promoter as previously described”’. This construct was injected into 
FVB (friend leukemia virus B mouse strain) oocytes, and after germline trans- 
mission the mice were examined for expression of the transgene. Wild-type 
(non-Tg) littermates served as controls. For light-dark experiments, mice were 
killed with CO, inhalation or isoflurane every 4h for 24h. For constant dark 
experiments, mice were placed in complete darkness for 36 h (starting at the end 
of light phase at ZT12) and hearts were collected every 4h over a 24-h period. 
RNA isolation and RT-PCR analysis: After euthanasia, hearts were collected, 
washed in cold phosphate buffered saline, the atria removed and the ventricles 
dissected to the apical and basal regions, and flash frozen in liquid nitrogen. 
RNA was isolated from the apical regions of frozen heart samples by homogeniza- 
tion in Trizol reagent (Invitrogen) by following the manufacturer’s instructions 
(Invitrogen). RNA was reverse transcribed after DNase treatment (New England 
Biolabs). RT-PCR was performed using locked nucleic acid (LNA)-based TaqMan 
approach with primers and probes designed, and their efficiency tested, at the 
Universal Probe Library (Roche), and with B-actin used as the normalizing gene. 
Cell-culture studies. Neonatal rat ventricular myocytes were isolated from 1-2- 
day-old rat pups and grown under standard conditions'*. Adenoviral overexpres- 
sion was performed for 24h and myocytes were then collected for mRNA and 
protein analysis. For synchronization, the myocytes were starved in media con- 
taining insulin, transferrin and selenium (ITS supplement, Sigma-Aldrich) for 
48h. After this, the myocytes were synchronized with 50% horse serum for 
30 min, washed twice with no-serum media and replenished with ITS-containing 
media. The mouse K/f15 promoter (approximately 5 kb) was cloned into PGL3- 
basic (Promega). The rat KChIP2 luciferase was a gift from P. H. Backx. Mutant 
constructs of rat KChIP2 luciferase were generated by PCR-based TOPO cloning 
(Invitrogen), and site-directed mutagenesis was performed using Quikchange II 
mutagenesis kit (Agilent Technologies) and confirmed by sequencing. KIf15 and 
KCAhIP2 luciferase studies were conducted in NIH3T3 cells, and luciferase activity 
was normalized to protein concentration. 

Western immunoblot analysis. For detecting Flag~KLF15, nuclear lysates were 
prepared using the NE-PER kit following manufacturer’s instructions (Thermo 
Scientific) and probed with anti-Flag antibody (Sigma). For KChIP2 analysis, 
whole-cell lysates were prepared by homogenizing the basal regions of the hearts 
in buffer containing Tris-HCl (50mM, pH7.4), NaCl (150mM), NP-40 (1%), 
sodium deoxycholate (0.25%), EDTA (1 mM), and supplemented with protease 
and phosphatase inhibitors (Roche). The blots were probed with a mouse 
monoclonal antibody against KChIP2 (NIH Neuromab), normalized to tubulin 
(Sigma-Aldrich) and quantified using Quantity One software (Bio-Rad). 

ChIP. ChIP was performed with hearts as previously described*'”’. In brief, hearts 
were fixed with fresh 1.11% formaldehyde for 10 min, and then by chromatin 
preparation and sonication (Diagenode). The sonicated chromatin was immuno- 
precipitated using BMAL1 or Flag antibody bound to Dynabeads (Invitrogen). 
The relative abundance was normalized to abundance of 28S between the input 
and immunoprecipitated samples as previously described*'. Primers that were 
used for BMAL1 ChIP on the K/f15 promoter were; forward, 5’-GCCTG 
AGCATCCTCCCCATCA-3’; reverse, 5'-GGGGCCACCTCTCTGGACTT-3’; 
and probe, 5'FAM-CCCGCCCAGTGACCATGTCTGCCTGT-3'BHQ1. Non- 
target primers were; forward, 5'-GCCAATTCACATTTCAACCA-3’; reverse, 
5'-GACACAAGGCATTTCAA-3’; and probe, 5’FAM-TGCAAAGGGCTGGA 
CATGGG-3'BHQI. Primers that were used for ChIP of Flag-KLF15 on the 
KChIP2 promoter were; forward, 5'-GCTCCGCTCTCACTTGCT-3’; and 
reverse, 5'-GGCTGGCAAGGCTTTTCT-3’. 

Telemetry ECG and interval analysis. Mice were implanted with telemetry 
devices (ETA F20, Data Sciences International) and allowed to recover for at least 
2 weeks. ECGs were recorded from conscious mice continuously in their native 
environment and digital data (PhysioTel, Data Sciences International) were stored 
for future analysis. Owing to rapid changes in the mouse heart rates, a weighted 
heart-rate approach was used to assess rhythmic changes in QT interval, and 
measurements were made every 2h over a 24-h period. First, the average heart 
rate was calculated for each hour by digital tracking of the ECG RR intervals (time 
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interval between two consecutive R waves) using the Dataquest analysis software 
(Data Sciences International). Then, during the first instance within each hour 
when the average heart rate was present, the QT interval was measured using 
electronic calipers from two consecutive beats. The QT interval was corrected 
for heart rate using a previously validated formula for conscious mice QT/(RR/ 
100)'” (ref. 23). A Cosinor model was applied to assess the 24-h rhythm in QT 
using a sinusoidal regression function and raw data presented in four hourly 
blocks for visualization purposes. 

Electrophysiological studies in myocytes. Murine ventricular myocytes were 
isolated using a standard enzymatic dispersion technique following overnight fast 
as previously described**. Myocytes were re-suspended in media 199, allowed to 
recover and recordings were conducted within several hours on the same day. The 
conventional whole-cell mode was used to record action potentials and Ito. In brief, 
myocytes were bathed in a chamber that was continuously perfused with Tyrode’s 
solution of the following composition (in mmoll!); NaCl, 137; KCl, 5.4; CaCh, 
2.0; MgSOy, 1.0; glucose, 10; and HEPES, 10 (pH 7.35). Patch pipettes (0.9-1.5 MQ) 
were filled with electrode solution composed of (in mmol17!): aspartic acid, 120; 
KCl, 20; NaCl, 10; MgCl), 2; and HEPES, 5 (pH 7.3). Action potentials were elicited 
in current-clamp mode by injection of a square pulse of current of 5 ms duration 
and 1.5-2 times the threshold amplitude. APD was measured at 90% repolariza- 
tion. To measure [,,, cells were placed in Tyrode’s solution (as described earlier) 
containing 1 1M nisoldipine to block calcium current and calcium-activated chlor- 
ide current, and tetrodotoxin (100 tmol1~') to block sodium current. Cells were 
brought from a holding potential of -70 mV to -25 mV for 25 ms. To isolate the 
fast, transient component of the outward currents, Ii, fs» the decay phase of 
outward potassium currents was fit by the exponential functions of the form: 


y(t) =A, exp(—t/t1) + Az exp(—t/t2) + Acs 


where 7 is the time constant of decay of the fast, transient component of outward 
potassium currents; A, is the amplitude coefficient of Ii, fast; Tz is the time constant 
of decay of the slow, transient component of the outward currents; A; is the 
amplitude of Ito slow; and Ags is the amplitude coefficient of the non-inactivating 
steady-state outward potassium current J,, Consistent with previous studies”, the 
time constant of decay of the fast, transient component Ito fast WaS 46 + 5 ms. The 
measured current amplitudes were normalized to cell capacitance and converted 
into current densities. All experiments were conducted at 36 °C. Cell capacitance 
and series resistance were compensated electronically at ~80%. Command and 
data acquisition were operated with an Axopatch 200B patch-clamp amplifier 
controlled by a personal computer using a Digidata 1200 acquisition board driven 
by pCLAMP 7.0 software (Axon Instruments). 

Programmed electrical stimulation. Intracardiac programmed electrical stimu- 
lation was performed as previously described”’. In brief, mice were anaesthetized 
using 1.5% isoflurane in 95% O, after an overnight fast. ECG channels were 
amplified (0.1 mV cm” ') and filtered between 0.05 and 400 Hz. A computer-based 
data acquisition system (Emka Technologies) was used to record a 3-lead body 
surface ECG, and up to four intracardiac bipolar electrograms. Bipolar right atrial 
pacing and right ventricular pacing were performed using 2-ms current pulses 
delivered by an external stimulator (STG-3008, MultiChannel Systems; 
Reutlingen). Standard clinical electrophysiologic pacing protocols were used to 
determine all basic electrophysiologic parameters. Overdrive pacing, single, 
double and triple extrastimuli, as well as ventricular burst pacing, were delivered 
to determine the inducibility of ventricular arrhythmias, which was tested twice. 
Statistical analysis. A cosinor model was adopted to determine whether there is a 
substantial 24-h rhythm in each physiological and molecular variable of interest. 
By pooling data points of all mice, the model fits data to a fundamental sinusoidal 
function”’. To determine the coefficients (amplitude and phase) of the sinusoidal 
function and to see whether there were significant relationships, a mixed model 
analysis of variance was performed using standard least-square regression and the 
restricted maximum likelihood method (JMP 8.0, SAS Institute) as previously 
described**. Data are presented as mean + s.e.m., the Student’s ¢-test was used 
for assessing the difference between individual groups and P = 0.05 was consid- 
ered statistically significant. 
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chromothripsis and defects in neuritogenesis genes 


Jan J. Molenaar'*, Jan Koster'*, Danny A. Zwijnenburg’, Peter van Sluis', Linda J. Valentijn', Ida van der Ploeg', Mohamed Hamdi’, 
Johan van Nes!, Bart A. Westerman!, Jennemiek van Arkel', Marli E. Ebus', Franciska Haneveld!, Arjan Lakeman’, Linda Schild!, 
Piet Molenaar!, Peter Stroeken!, Max M. van Noesel”, Ingrid Ora’, Evan E. Santo!, Huib N. Caron’, Ellen M. Westerhout! 


& Rogier Versteeg! 


Neuroblastoma is a childhood tumour of the peripheral sympathetic 
nervous system. The pathogenesis has for a long time been quite 
enigmatic, as only very few gene defects were identified in this often 
lethal tumour’. Frequently detected gene alterations are limited to 
MYCN amplification (20%) and ALK activations (7%)*°. Here we 
present a whole-genome sequence analysis of 87 neuroblastoma of 
all stages. Few recurrent amino-acid-changing mutations were 
found. In contrast, analysis of structural defects identified a local 
shredding of chromosomes, known as chromothripsis, in 18% of 
high-stage neuroblastoma®. These tumours are associated with a 
poor outcome. Structural alterations recurrently affected ODZ3, 
PTPRD and CSMD1, which are involved in neuronal growth cone 
stabilization’ ’. In addition, ATRX, TIAM1 and a series of regula- 
tors of the Rac/Rho pathway were mutated, further implicating 
defects in neuritogenesis in neuroblastoma. Most tumours with 
defects in these genes were aggressive high-stage neuroblastomas, 
but did not carry MYCN amplifications. The genomic landscape of 
neuroblastoma therefore reveals two novel molecular defects, chro- 
mothripsis and neuritogenesis gene alterations, which frequently 
occur in high-risk tumours. 

Neuroblastoma have a highly variable clinical outcome, with an 
excellent prognosis for stage 1 and 2 tumours, but a poor outcome 
for high-stage tumours. Stage 4S neuroblastoma are metastasized but 
nevertheless undergo spontaneous regression. Low-stage tumours are 
marked by numeric changes of chromosomal copy numbers, whereas 
high-stage tumours typically show structural chromosomal defects 
resulting in, for example, hemizygous deletions of the chromosomal 
regions 1p36 or 11q and gain of 17q (refs 1, 10-12). Age at diagnosis 
above 1.5 year is associated with high-stage tumours and poor outcome. 

We performed whole-genome paired-end sequencing as used by 
Complete Genomics’? for 87 untreated primary neuroblastoma 
tumours of all stages (Supplementary Table 1) and their corresponding 
lymphocyte DNAs. All samples had a minimal tumour content of 80% 
as determined by immunohistochemical analysis. Genomes were 
sequenced at an average coverage of 50 and an average fully called 
genome fraction of 96.6% (Supplementary Table 2). Compared to the 
HGI18 reference genome we obtained an average of 3,347,592 single- 
nucleotide variants (SNVs) per genome, in accordance with reported 
frequencies of interpersonal variants. CGAtools was used to compare 
tumour with lymphocyte genomes and provided a somatic score 
estimating the likelihood of mutations to be somatic (http://cgatools. 
sourceforge.net/docs/1.3.0/). Validation of 1,014 candidate somatic 
small mutations (SNVs, substitutions, insertions, deletions), including 
763 SNVs, established a specificity of 88% and a sensitivity of 85% at a 
somatic score cut-off of 0.1 (Supplementary Fig. 1a). SNVs above this 
score and all validated SNVs with lower scores were used for further 


analyses (total 586 genes, Supplementary Table 3). The sequence data 
identified an average of 12 somatic candidate amino-acid-affecting 
mutations per tumour (Fig. la and Supplementary Fig. 1b). The fre- 
quency of somatic events strongly correlated to tumour stage where 
stage 1, 2 and 4S tumours have very few mutations compared to stage 3 
and 4 tumours (analysis of variance (ANOVA) P=7.6 X 10 °; 
Fig. 1b). In addition mutation frequencies were strongly correlated 
to overall survival (log-rank P = 9.8 X 10’; Fig. 1c) and age at diagnosis 
(r=0.53, P=1.1X 10 7; Fig. 1d), as was also observed in medul- 
loblastoma™. Within high-stage neuroblastoma, MYCN amplification 
status did not correlate to mutation frequency (Supplementary Fig. 1c). 

Only very few recurrent mutations were identified. ALK mutations 
were found in 6% of the tumours, in accordance with frequencies estab- 
lished in large neuroblastoma tumour series (Supplementary Table 4)*>. 
Three tumours carried mutations in TIAM1, a known regulator of 
cytoskeleton organization and neuritogenesis’’. In a parallel study we 
sequenced four primary neuroblastoma tumours as well as cell lines 
derived from these tumours and their metastases. This revealed that 
primary tumours are already heterogeneous for mutations and that 
the large majority of them were passenger or late mutations (J.J.M. 
et al., submitted). Together with the lack of recurrent mutations, our 
data indicate that neuroblastoma carry few early somatic tumour- 
driving mutations with amino-acid-changing consequences. 

Analysis of the paired-end clones with discordant ends can be used 
to identify candidate structural rearrangements, which together with 
sequence coverage data can identify somatic structural variants (SVs). 
Comparison of tumour versus lymphocyte coverage generated ultra- 
high-resolution comparative genomic hybridization (CGH)-like profiles 
(Supplementary Fig. 2a). Analysis of the frequency of structural varia- 
tions per chromosome revealed ten tumours with chromothripsis 
characteristics® (see Methods). Chromothripsis is a localized shredding 
of a chromosomal region and subsequent random reassembly of the 
fragments. An extreme example of chromothripsis in chromosome 5 is 
shown in Fig. 2a and 2b (for other cases see Supplementary Fig. 2b). 
The neuroblastoma with chromothripsis were associated with a poor 
prognosis (log-rank test P= 7.1 10°; Fig. 2c). They were found 
in 18% of the stage 3 and 4 neuroblastoma, but not in low-stage 
tumours (Fisher’s exact test P= 0.01). Accordingly, their prognostic 
impact is not independent of age and stage in multivariate analyses. 
Chromothripsis-related structural aberrations frequently affected 
genes involved in neuroblastoma pathogenesis and were associated 
with amplification of MYCN or CDK4 and loss of heterozygosity of 
1p (Supplementary Fig. 2c). In one tumour, chromothripsis resulted in 
amplification and very strong overexpression of MYC (c-Myc) (Sup- 
plementary Fig. 2d). Chromosome 5 had undergone chromothripsis in 
three tumours, but no clear tumorigenic target on this chromosome 
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Figure 1 | Frequency of amino-acid-changing somatic mutations in 
neuroblastoma correlates with age, stage and survival. a, The number of 
amino-acid-changing mutations in 87 primary neuroblastoma (single 
nucleotide variants (SNVs) in red, deletions in grey, insertions in green and 
substitutions affecting more than 1 base pair (Sub) in blue). Numbers shown 
are events after CGAtools CallDiff with somatic scores >0.1 and not present in 
dbSNP130, nor in 46 reference genomes released by Complete Genomics. 

b, Average number of mutations per tumour stage (International 
Neuroblastoma Staging System (INSS) stage 1, n = 9; stage 2, n = 14; stage 3, 
n= 5; stage 4, n = 50 and stage 4S, n = 9). Boxes include 50% of data and error 
bars indicate extremes with a maximum of two times the box size. st., stage. 
c, Kaplan-Meier curves for tumours with high versus low frequency of 
mutations. The optimal cut-off level for the categories was determined by 
Kaplan scanning (see Supplementary Information and Methods). The 
significance (log-rank test) was corrected for the multiple testing (Bonferroni 
correction). Number of patients per group is shown in parentheses. d, Age at 
diagnosis (rank-order) versus the number of somatic variants. e, Average 
number of structural variations per tumour stage (INSS). Group sizes and the 
definition of the error bars as in Fig. 1b. 


was identified. To identify genetic defects that allowed chromothripsis 
and subsequent survival of the cell, we searched for defects in DNA 
damage response pathways in tumours with chromothripsis. The most 
extreme case of chromothripsis (N492, Fig. 2a and 2b) showed an 
inactivating deletion in FANCM and another chromothripsis tumour 
sample (N576) had a missense mutation in FAN1, predicted to be 
damaging by the polyphen2 program’. These findings might suggest 
involvement of inactivating events in the Fanconi anaemia signalling 
pathway to allow chromothripsis”’. 

Full genome paired-end sequencing allowed us to identify structural 
variants specifically perturbing single genes (see Methods and Sup- 
plementary Fig. 4 for selection procedure). We detected a total of 451 
genes harbouring structural variants (306 genes without the events on 
chromothripsis chromosomes, Supplementary Tables 5 and 6). The 
structural variants often consisted of deletions of one or a few exons, 
inversions or translocations deleting part ofa gene. One tumour showed 
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an intrachromosomal rearrangement activating FOXRI transcription 
(Supplementary Fig. 2e), which we recently identified as a recurrent but 
rare event in neuroblastoma’’. Similar to the findings for amino- 
acid-changing mutations, there was a strong relation between the fre- 
quency of structural variations and the tumour stage (one-way 
ANOVA P= 0.03; Fig. le), which extends the well-established rela- 
tionship between tumour stage and structural chromosomal defects in 
neuroblastoma’’’, Breakpoints identifying deletions were supported 
by changes in coverage plots. Most of the structural variants affected 
only one allele of a gene (Supplementary Table 5). This indicates that 
the tumour-driving mechanism of these defects is haploinsufficiency, 
possibly combined with epigenetic attenuation of the non-affected 
allele. On average, genes with structural variants resulting in loss of 
coverage indeed showed a reduced expression in tumours with these 
defects, as compared to tumours with normal alleles (Supplementary 
Fig. 2f). As an additional validation, we generated SNP arrays of 52 of 
the sequenced tumours. Although the SNP data have a much lower 
resolution than the sequence coverage plots, they supported the dele- 
tions and gains of sufficient size. This is especially evident on plots of 
chromothripsis samples (Supplementary Fig. 2g). 

To identify relevant genes and pathways that contribute to neuro- 
blastoma pathogenesis, we generated one list of all genes with amino- 
acid changing mutations (n = 586), mutations in splice junctions 
(n = 37) and structural variations (n = 451). The total of 1,041 genes 
with alterations were analysed by two approaches. First, we analysed 
the most frequently affected genes (Supplementary Table 7). Four 
genes belonged to the MYCN amplicons (MYCN, MYCNOS, DDX1 
and NBAS) and except for MYCN probably play no role in pathogenesis. 
Three genes, PTPRD, ODZ3 and ATRX, showed structural variants in 
five tumours each (Fig. 3a and Supplementary Fig. 3a) and 61 genes 
showed alterations in two to four tumours (Supplementary Table 7). A 
conservative randomization, taking the length of all genes and the 
structure of our data set into account, showed that the chance of 
finding three genes affected in five or more tumours is <2.11 x 10 * 
(see Methods). This strongly indicates that at least the defects in 
PTPRD, ATRX and ODZ3 did not accumulate due to for example, 
the genomic length of the genes, but that they were selected for during 
the process of tumorigenesis. 

The X-chromosome-encoded ATRX gene was affected by structural 
variants in five tumours (Fig. 3a). In two male patients this resulted in 
complete inactivation of the gene. Frequent ATRX defects were recently 
found in pancreatic neuroendocrine tumours”. ATRX is a chromatin 
remodelling protein involved in exchange of H3.3 in GC-rich repeats 
and mutations of this gene are associated with X-linked mental retarda- 
tion”. Exon mRNA profiles of part of the sequenced series showed that 
the three samples included with ATRX structural variations had the 
lowest ATRX mRNA expression of all samples and showed a specific 
collapse of the signal in the deleted regions, illustrating the inactivating 
nature of the ATRX defects (Fig. 3b and Supplementary Fig. 3b). 

ODZ3 and PTPRD were also hit by structural variations in five 
tumours each (Supplementary Fig. 3a). One tumour showed homo- 
zygous inactivation of ODZ3 (see legends of Supplementary Fig. 3a). 
In addition, ODZ2 and ODZ4, two highly homologous members of 
the conserved ODZ family, were together affected three times. PTPRD 
and ODZ genes encode transmembrane receptors expressed in the 
developing nervous system and localizing to axons and axonal growth 
cones’. Targeted silencing of ODZ homologues in Drosophila, 
Caenorhabditis elegans and mouse caused severe axon guidance 
defects’. Overexpression of ODZ2 in neuroblastoma cells enhanced 
neuritogenesis**. PTPRD is a member of the LAR subfamily of receptor 
protein tyrosine phosphatases. Transgenic mouse models strongly 
implicate the LAR subfamily receptors in neuritogenesis*. PTPRD 
defects in neuroblastoma were reported previously~*. Low expression 
of ODZ3 and of PIPRD as assessed by mRNA profiling were 
both associated with a poor prognosis (log-rank P = 3.1 X 10 * and 
P=5.7X10 °, respectively; Supplementary Fig. 3a). Interestingly, 
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Figure 2 | Chromothripsis is frequent in neuroblastoma and is associated 
with a poor prognosis. a, Circos plot showing structural variations in sample 
N492. The inner ring represents the copy number variations (red, gain; green, 
loss) based on coverage of the tumour and lymphocyte genomes. The lines 
traversing the ring indicate inter- and intrachromosomal rearrangements 
identified by discordant mate pairs from paired-end reads. N492 is a 
chromothripsis sample with an extreme amount of junctions on chromosome 
5.b, Circos plot of the affected chromosome 5 in sample N492. c, Kaplan-Meier 
curves of the overall survival for tumours with or without chromothripsis. 
Numbers of patients per group are shown between brackets. 
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CSMD1, which showed structural variants in three tumours, is also a 
transmembrane protein expressed on nerve growth cones’. As the 
frequencies of PTPRD and ODZ3 defects exclude that they were found 
by chance, we propose that the function of these genes and of ODZ2, 
ODZ4 and CSMD1 in neuronal growth cones might hold a clue to their 
function in neuroblastoma pathogenesis. 

The second analysis that we performed for the list for 1,041 affected 
genes was a gene ontology study to identify enrichment of genes with 
defects in specific molecular processes. The gene ontology category 
‘regulation of GTPase activity’ was the most significantly enriched 
group (Bonferroni corrected for multiple testing: P = 6.7 X 10 *; see 
supplementary methods and Supplementary Table 9). This finding 
urged us to further investigate GTPase-regulating genes in the list. 
TIAM1 was mutated in three tumours (see Supplementary Table 4). 
It functions as a guanine nucleotide exchange factor (GEF) for the 
small GTPase Rac and is, together with Rac, central to regulation of 
cellular polarity and neuritogenesis**. The W1285S* mutation 
creates a premature stop-codon in the carboxy-terminal pleckstrin 
homology domain required for Rac activation, whereas the other 
mutations were predicted to be damaging by polyphen2 analysis’®. 
Racis activated by GEFs and inactivated by GTPase activating proteins 
(GAPs)”® (Fig. 4). We identified a total of eight alterations in six GEFs 
specific for Rac (including TIAM1), but none in GAPs specific for Rac 
(Supplementary Tables 7, 8 and 10 for functional consequences). 
Whereas activation of Racl stimulates neuritogenesis, activation of 
its small GTPase antagonist RhoA promotes axon retraction and 
growth cone collapse (Fig. 4a)'°. Strikingly, we detected seven altera- 
tions in five GAPs for RhoA, but only one GEF specific for RhoA 
(ARHGEF12) showed a translocation with unknown functional con- 
sequences (Fig. 4a, Supplementary Tables 7, 8 and 10 for functional 
consequences). The bias for inactivation of GAPs for Rho and GEFs for 
Rac is highly significant (one-sided Fischer’s exact test: P< 0.0007). 
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Figure 3 | Structural variations in ATRX result in low mRNA expression 
levels. a, Coverage plots displaying the structural variations in the ATRX gene. 
The dots indicate summed coverage for bins of 1,000 base pairs of the tumour 
genome, normalized to the coverage in corresponding normal tissue. The 
intron-exon structure of ATRX is shown in red (dark red are exons). b, ATRX 
mRNA expression of 70 tumours as measured on Affymetrix full-exon arrays. 
Tumours with ATRX deletions are encircled. Coloured tracks below figure, 
from top to bottom: age at diagnosis (green < 1.5 year, red > 1.5 year); survival 
(red, dead; dark green, alive >5 year; light green, alive <2 year); MYCN 
amplification (red, yes; green, no); stage (light green, stage 1; dark green, stage 2; 
brown, stage 3; red, stage 4; blue, stage 4S). 
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Figure 4 | Neuroblastoma with genomic defects in neuritogenesis genes 
cluster in high-risk tumours. a, Diagram ofa neurite growth cone depicting the 
function of the proteins encoded by genes with genomic aberrations in 
neuritogenesis. Red proteins have defects (for references see Supplementary 
Table 8). Rac and Rho small GTPases cycle between an inactive GDP-bound and 
active GTP-bound conformation, transducing signals from a wide variety of 
membrane receptors. They are activated by GEFs and inactivated by GAPs. 
Guanine nucleotide dissociation inhibitors (GDIs) sequester GDP-bound 
GTPases. Proteins with aberrations in more than one tumour are marked with 
an asterisk (*). b, Diagram of genetic defects and clinical parameters of all 87 
sequenced neuroblastoma. Each vertical lane summarizes one tumour. Patients 
are sorted by the presence of genomic aberrations in neuritogenesis genes 
(Neuritogenesis, n = 19), by MYCNamplification (MYCN, n = 23), and by INSS 
stage (high to low). Clinical and molecular genetic characteristics are shown for 
each tumour as tracks; INSS stage (green, stage 1 and 2; red, stage 3 and 4; orange, 
stage 4S); Survival (red, death; green, alive), Age group (green, <1.5 year; red, 
21.5 year), ALK (red, mutated; green, wild type), Chromothripsis (red, yes; 
green, no). Middle panel, amino-acid-changing mutations and structural 
variations are indicated for all genes having two or more events and that are 
involved in neuritogenesis (red, mutated or structural variant; grey, not affected). 


This indicates that alterations in GTPase-regulating genes specifically 
function to activate Rho or inactivate Rac, which both tip the balance 
in Rac/Rho signalling towards inhibition of neuritogenesis. Of note, 
transgenic mice with ATRX mutations causing mental retardation in 
humans showed abnormal dendritic spine formation with increased 
TIAM1 phosphorylation and Racl signalling”. 

We conclude that alterations with significant frequencies (PTPRD 
and ODZ genes) affect transmembrane receptors that function in 
neuronal growth cone guidance and maintenance. In addition gene 
ontology analysis of the 1,041 genes showed significant enrichment of 
GTPase-regulating genes. Alterations in GEFs for Rac and GAPs for 
Rho significantly deviate from a random distribution, implicating 
inhibition of Racl and activation of RhoA in impairing neuritogenesis 
in neuroblastoma (Fig. 4a). 

From these findings we propose that defects in neuritogenesis- 
regulating genes form an important category of tumour-driving events 
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in neuroblastoma. For a preliminary analysis of tumours with these 
defects, we selected the genes with recurrent defects in tumours that 
function in neuronal growth cones (PTPRD, ODZ3, ODZ2, CSMD1) or 
regulation of these processes through Rac/Rho signalling (TIAM1, 
DLC1, ARHGAP10, ATRX). The 19 tumours with defects in these 
genes were almost all stage 3 and 4 tumours diagnosed above 1.5 year 
of age with an aggressive clinical course. Only few of them showed 
amplification of MYCN (Fig. 4b). Consistent with their occurrence in 
high-stage neuroblastoma, defects in neuritogenesis genes did not 
show an independent prognostic power in multivariate analysis with 
the clinical parameters age and stage. 

Here we report the first whole-genome sequence study of a com- 
prehensive series of neuroblastoma including both low- and high-stage 
tumours. Low-stage neuroblastoma lacked recurrent gene alterations 
(mutations and structural variations), raising the question whether 
they are primarily driven by chromosomal imbalances and the con- 
sequent gene dosage effects. Tumours with defects in genes function- 
ing in neuritogenesis or growth cone guidance mostly are aggressive 
high-stage tumours without MYCN amplification. Interestingly, there 
is indication that MYCN also inhibits neuritogenesis in neuroblastoma. 
MYCN downregulates the mRNA expression of the chromosome 1p36 
gene CDC42, which resulted in inhibition of neuritogenesis of neuro- 
blastoma cells**. CDC42 is a small GTPase protein with a function 
similar to Racl. Racl and CDC42 both regulate the Par3—Par6 complex 
(also known as PARD3-PARD6A) involved in cell polarization and 
growth cone development and which has been shown to drive neuro- 
blastoma cell differentiation”. In light of our data demonstrating geno- 
mic alterations in Rho/Rac signalling, it is tempting to postulate that 
the Par3—Par6 complex is a recurrent target for inactivation in neuro- 
blastoma. Intriguingly, the block in neuritogenesis of neuroblastoma is 
probably not absolute. Retinoic acid can induce neuronal outgrowth in 
many neuroblastoma cell lines, which all are derived from high-stage 
tumours. Retinoic acid is also used in long-term neuroblastoma treat- 
ment protocols to prevent recurrences”. It is currently unknown which 
tumours will clinically respond to retinoic acid therapy. Deletions of 
NF1 were previously implicated in retinoic acid resistance of neuro- 
blastoma cell lines**. The identification of genomic alterations in a 
range of genes mediating neuritogenesis now allows investigation of 
therapeutic modalities to surmount these defects. 


METHODS SUMMARY 


All neuroblastoma samples were derived from primary tumours of untreated 
patients. Tumour material was obtained during surgery and a portion was imme- 
diately frozen in liquid nitrogen. We used leukocytes derived from peripheral 
blood for isolation of constitutional DNA. High-molecular-weight DNA was 
extracted from tumour tissue and leukocytes using standard procedures. Fifteen 
microgram of DNA was subjected to paired-end whole-genome sequencing 
according to the Complete Genomics technology. Initial data analyses were per- 
formed using the CGAtools v1.3.0 package (http://cgatools.sourceforge.net/docs/ 
1.3.0/) and for subsequent analysis and figure preparation we used the R2 
bioinformatic platform (http://r2.amc.nl) and PERL scripts. More details on muta- 
tion analysis, coverage analysis, analysis of structural variants and the Circos plots 
is given in the Supplementary Information. Gene expression was assayed on 
Affymetrix EXON ST 1.0 GeneChips. SNP genotyping microarray analysis details 
are described in the supplementary information. Mutation validations were per- 
formed using Sanger sequencing (Supplementary Information). Statistical analysis 
(gene ontology enrichment using the Cytoscape BINGO plugin, DAVID func- 
tional ontology cluster analysis and analysis for gene length enrichment) are 
described in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Small variants. Variant selection procedure: Potential somatic variants were 
determined with the CallDiff method with somatic output within the CGAtools 
v1.3.0 package, maintained by Complete Genomics (http://cgatools.sourceforge. 
net/docs/1.3.0/). Every tumour or cell line sample was compared to its matched 
blood sample across the whole genome. The somatic score which is calculated tries 
to tease apart true somatic mutations from false somatic mutations (http:// 
cgatools.sourceforge.net/docs/1.3.0/). The somatic output files were then filtered 
to those regions where coding sequences are defined for the UCSC refflat annotation 
(http://hgdownload.cse.ucsc.edu/goldenPath/hg18/database/refFlat.txt 2 August 
2010), thereby removing discontinued genes from the original annotation of 
NCBI36.3. Subsequently, gene symbol, amino acid change and effect categories 
were extracted from gene-GSXXX-ASM.tsv files for all genomes and added as 
annotation to the somatic output results. Those variants that could not be 
annotated (new/updated genes) were annotated with custom PERL scripts where 
possible. All variants were annotated for their presence in dbSNP130 (http:// 
hgdownload.cse.ucsc.edu/goldenPath/hg18/database/snp130.txt), as well as the 
presence within 37 public HapMap genomes released by complete genomics 
(ftp://ftp2.completegenomics.com/). SIFT (http://sift.jcviiorg/www/SIFT_chr_coords_ 
submit.html NCBI36) and polyphen2 (http://genetics.bwh.harvard.edu/pph2/ 
bgi.shtml NCBI36) scores were determined to assess potential impact for all the 
SNP variants. Variants, which are reported in dbSNP130, that were found in any of 
the normal blood samples or that were found within the public genomes from 
Complete Genomics were removed from the data set. Finally, variants which were 
found in genes that are not expressed in neuroblastoma, but do show expression in 
a series of 500 normal samples were removed (See Affymetrix expression analysis). 
Somatic small variants trim-down. CallDiff with somatic output was performed 
on all of the 87 tumour/lymphocyte pairs and processed. The number of events 
(split by deletion/insertion/substitution/SNP) were counted for the complete 
genome. Next, the somatic output files were filtered on those parts of the genome 
that are covered by amino-acid-encoding regions(based on the coding sequence 
(CDS) within the refflat file from the UCSC). In the next step, only those variants 
were kept that have an impact on the coding sequence (non-silent), do not occur in 
any of our normal samples, nor in any of the publicly available genomes from 
Complete Genomics, nor are present in dbSNP130. Finally, we filtered for the 
somatic score to be 20.1 (as determined by plotting of the Sanger sequence 
validation results as a function of somatic scores and total scores). 

Sanger sequencing. High-molecular-weight DNA was extracted from tumour 
tissue and leukocytes using standard procedures*’. Primers for PCR amplification 
were automatically designed by custom PERL scripts that execute the Primer3 
software. PCR was performed using 20 ng of genomic DNA. Sanger sequencing 
was performed on a capillary sequencer using standard procedures. 

Complete Genomics comparative genomic hybridization (cgCGH) procedure. 
First we determine the summed coverage (uniqueSequenceCoverage) in windows 
of 1,000 base pairs (measured on the reference genome) for the normal and the 
tumour sample (coverageRefScore files). Here we take the Integer(position/1,000) 
as the bin for any position and keep track of the sum. Then we determine the total 
coverage sums of the genome and normalize to this value, to remove differences in 
total coverage between samples. Subsequently we determine the log(tumour/ 
genome)/log(2) for every bin (1,000-bp window) and obtain a cgCGH profile that 
expresses the somatic changes of the respective tumour. As the profile is normalized 
to its own normal material, the most prominent sources of bias such as GC-content, 
and also per-person copy number variation characteristics are corrected for. We 
feed these results to the DNAcopy algorithm as provided in R BioConductor to 
segment the information into blocks of similar characteristics and use these seg- 
ments boundaries to store the information in an efficient way. cgCGH data was 
visualized within the embedded genome browser of R2 (http://r2.amc.nl). 

Circos plots. Comparisons of somatic structural variants between tumour and 
lymphocyte genomes were performed with the JunctionDiff and Junction2Event 
tool from CGAtools (http://cgatools.sourceforge.net/docs/1.4.0/). These somatic 
events were filtered with the following criteria: events annotated as artefacts, 
footprints smaller than 70 bases, less than 10 discordant mate pairs, under- 
represented repeats, and presence in a set of v2.0 baseline genomes (as provided 
at the website of Complete Genomics (B36baseline-junctions.tsv)). cgCGH pro- 
files and the remaining events were plotted with the Circos program (http:// 
Www.circos.ca). 

Chromothripsis. Genomes were annotated as having chromothripsis-like char- 
acteristics when the sum of intra-chromosomal somatic junctions (as reported by 
JunctionDiff and filtered as above) within a single chromosome was larger or equal 
to 20. Focused amplified regions (cgCGH scores =3) within a chromosome were 
excluded from this sum. Using these characteristics, we annotated 10 out of the 87 
patients as chromothripsis-like. Nine out of these patients were diagnosed with 
stage 4 neuroblastoma (P = 0.0392 stage4 versus rest) and all 10 were present in 


high-stage neuroblastoma (P = 0.0116). Eight of these patient have died of the 
disease (P = 0.0413; log-rank P= 7.1 X 10°°). 

Affymetrix expression analysis (expressed genes). To assess whether genes con- 
taining variants are expressed in neuroblastoma, we make use of a panel of neuro- 
blastoma tumours (also including 53 of the sequenced tumours), classical 
neuroblastoma cell lines as well as recently generated patient derived cell lines 
(n = 119 in total).All samples were derived from primary tumours of untreated 
patients. Material was obtained during surgery and immediately frozen in liquid 
nitrogen. The original sources for classical neuroblastoma cell lines can be found in 
ref, 32. Total RNA of neuroblastoma samples was extracted using TRIzol reagent 
(Invitrogen) according to the manufacturer's protocol. RNA concentration and 
quality were determined using the RNA 6000 nano assay on the Agilent 2100 
Bioanalyzer (Agilent Technologies). Fragmentation of complementary RNA, 
hybridization to hg-u133 plus 2.0 microarrays and scanning were carried out 
according to the manufacturers protocol (Affymetrix). The data were deposited 
in the NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) 
under accession number GSE16476. 

Affymetrix expression data from adult tumours were derived from the 
Expression Project for Oncology (ExpO) database from the International 
Genomics Consortium (IGC) (http://www. intgen.org/expo.cfm). Expression data 
on normal tissues was downloaded from GEO (GSE7307). The expression data 
were normalized with the MAS5.0 algorithm within the GCOS program of 
Affymetrix. Target intensity was set to 100. If more than one probe set was 
available for one gene the probe set with the highest expression was selected, 
considered that the probe set was correctly located on the gene of interest. For 
101 patients (60 of the sequenced tumours), we have also generated Affymetrix 
Exon array data. The z-score of expression within this data set was used as a 
independent validation of structural variants annotated as deletions, where 
possible. All data were analysed using our in-house-developed R2 web application, 
which is freely accessible at http://r2.amc.nl. 

SNP array analysis. SNP arrays were processed according to the manufacturer’s 
recommendations with the Infinium II assay on Human370- and Human660- 
quad arrays containing >370.000 and >660,000 markers, respectively, and run on 
the Illumina BeadStation (Swegene Centre for Integrative Biology, Lund 
University — SCIBLU, Sweden) according to the manufacturer’s recommenda- 
tions. Raw data were processed using Illumina’s BeadStudio software suite 
(Genotyping module 3.0), producing report files containing normalized intensity 
data and SNP genotypes. Subsequently, log, ratio and B-allele frequency data were 
imported into the R2 web application for detailed analysis and comparison with 
the CGH and expression data. 

Selection procedure somatic small variants table. We ran CallDiff with somatic 
output on all of the 87 tumour/lymphocyte pairs and processed the somatic output 
files as described in the small variants section earlier. Due to the low validation 
percentages of substitutions and insertions, these were removed from the table. 
Variants which were tested by Sanger sequencing and were all-reference or all- 
variant were removed from the table. Variants with somatic scores lower than 0.1 
or of types insertion/substitution, but which were validated by Sanger sequencing, 
were maintained in the table. In addition, we determined the presence of somatic 
splice-site variants in the two bases surrounding exons as defined by the UCSC 
refflat data. 

Selection procedure somatic structural variant table. Comparisons between 
tumour and lymphocyte genomes were performed with the JunctionDiff and 
Junction2Event tool from CGAtools. These somatic events were filtered with 
the following criteria: events annotated as artefacts, footprints smaller than 70 
bases, less than 10 discordant mate pairs, under-represented repeats, and presence 
in a set of baseline genomes (as provided at the website of Complete Genomics 
(B36baseline-junctions.tsv)). Of the remaining entries, we kept the following 
events: exon_bites where both ends of a junction are within the same gene, and 
in addition affect exonic sequence. Breaks by inversion, where both ends of a 
junction land within a gene, thereby damaging both genes, but leaving the genes 
in between unaffected. Potential fusion genes which are strand-matched, where 
both ends of a junction land within a gene, and the resulting end product fits in 
terms of orientation of both genes. Regions (deletions/(tandem) duplications) of up 
to 1 megabase, containing up to five genes which are expressed in neuroblastoma. 
Combination of small, splice and structural somatic variants table. The small 
variant, structural variant and splice-site somatic tables were merged on the gene 
symbol, and unique tumour IDs were counted to obtain a final list of recurrent 
gene affecting variants. 

Kaplan scan. Kaplan scanning was performed within R2 (http://r2.amc.nl). In 
short, for each gene or other numerical characteristic, R2 calculates the optimal 
cut-off expression level dividing the patients in a good and bad prognosis cohort. 
Samples within a data set are sorted according to the expression of the investigated 
gene and divided into two groups on the basis of a cut-off expression value. All 
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cut-off expression levels and their resulting groups are analysed for survival, with 
the provision that minimal group number is 8 (or any other user-defined value) 
samples. For each cut-off level and grouping, the log-rank (as described in ref. 33) 
significance of the projected survival is calculated. The best P value and corres- 
ponding cut-off value is selected. This cut-off level is reported and used to generate 
a Kaplan-Meier graph. The graph depicts the log-rank significance (‘raw P’), as 
well as a P value corrected for the multiple testing (Bonferroni correction) of cut- 
off levels for each gene. 

Statistical analysis for gene ontology gene enrichment. To investigate the pos- 
sible role of the mutated genes we used the BinGO plugin® for the network 
visualization tool Cytoscape*’. This tool assesses enrichment of gene ontology 
categories for a set of genes. The P-value assigned to the overrepresentation for 
a specific category is calculated through a hypergeometric test. Results are cor- 
rected for multiple testing. We used the set of 1,041 genes having a mutation or 
structural variation in one or more neuroblastoma tumours. In the more inform- 
ative gene ontology categories (filter set to less than 500 genes per category); the 
‘regulation of GTPase activity gene ontology category seemed to be the most 
significant. The DAVID online functional clustering tool’* confirmed this analysis 
for the biological process gene ontology branch; the third cluster in the list con- 
tained the GTPase regulation category with an enrichment score of 1.44. 
Statistical significance of genes affected by amino-acid-changing mutations 
and structural variations. We analysed the likelihood of finding defects in 
specific genes, or of finding defects in equal to/more than the number of 
affected patients. We therefore used the following randomization strategy: we 
started by counting the number of genes affected by structural variations, for every 
sample in our data set. Next, the genomic footprints (bases from start to the end of 
the RefSeq on the HG18 genome (UCSC refflat August 2010)) were determined 
and merged to the largest possible span for every gene symbol. These stretches 
were fused to create an artificial sequence, with the length of all gene symbols 
combined. Within this artificial sequence, random nucleotide numbers were 
chosen and traced back to the corresponding gene symbols. This was done for 
each tumour sample separately, with the number of randomly selected nucleotides 
being identical to the number of structural defects identified in the actual 
(sequenced) tumour. An identical strategy was used for the CDS affecting 
amino-acid-changing mutations, where the length of the largest CDS (NCBI 
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RefSeq data set August 2010) per gene symbol has been used, to generate the 
artificial sequence. 

In this way, we could generate artificial data sets, that most closely reflect the 

complexity of our sequenced sample set, including the spread in events, as well as 
the genomic footprints (size)/lengths of the CDS of all the different genes within 
the genome. We generated 109,000 artificial data sets, after which we created 
histograms to assess the null distributions for all genes, as well as the likelihood 
of finding combinations of recurrent events. The chance of finding any combina- 
tion of three or more genes affected in five or more patients within our data set was 
0.000211. Of note, this is a conservative estimation, as our real data set also 
included four frequently affected genes in the MYCN amplicon. The chance of 
affecting seven genes in five or more patients is much lower than 10” °. In addition, 
the chances of finding ATRX, ODZ3 or PTPRD in five or more patients was 
<10-°, 6.42 X 10° and 0.00534, respectively. 
Statistical analysis for Rac/Rho GEF/GAP specificity. To ascertain whether the 
ratio of mutations/structural variations in Rac-specific GEFs and Rho-specific 
GAPs was significantly different from expectance, we performed Fisher’s exact 
test on the number of mutations in GAP and GEF with specificity for only Rac or 
Rho in our data set (one-sided Fisher’s exact test, P = 0.0007). We also pooled 
events in Rac-specific GEFs and Rho-specific GAPs together and tested whether 
they possessed more events than the pool of Rac-specific GAPs and Rho-specific 
GEFs (one-sided Fisher’s exact test, P = 0.026). 


31. Molenaar, J. J., van Sluis, P., Boon, K., Versteeg, R. & Caron, H. N. Rearrangements 
and increased expression of cyclin D1 (CCND1) in neuroblastoma. Genes 
Chromosom. Cancer 36, 242-249 (2003). 

32. Thiele, C. J. In Human Cell Culture vol. 1 (ed. Masters, J.) 21-53 (1998). 

33. Bewick, V., Cheek, L. & Ball, J. Statistics review 12: survival analysis. Crit. Care 8, 
389-394 (2004). 

34. Maere, S., Heymans, K. & Kuiper, M. BINGO: a Cytoscape plugin to assess 
overrepresentation of gene ontology categories in biological networks. 
Bioinformatics 21, 3448-3449 (2005). 

35. Smoot, M. E., Ono, K., Ruscheinski, J., Wang, P.-L. & Ideker, T. Cytoscape 2.8: new 
features for data integration and network visualization. Bioinformatics 27, 
431-432 (2011). 

36. Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis 
of large gene lists using DAVID bioinformatics resources. Nature Protocols 4, 44-57 
(2008). 


©2012 Macmillan Publishers Limited. All rights reserved 


Lait Ieee. 


doi:10.1038/nature10911 


The mechanism of OTUB1-mediated inhibition 


of ubiquitination 


Reuven Wiener’, Xiangbin Zhang’, Tao Wang'} & Cynthia Wolberger' 


Histones are ubiquitinated in response to DNA double-strand breaks 
(DSB), promoting recruitment of repair proteins to chromatin’. 
UBC13 (also known as UBE2N) is a ubiquitin-conjugating enzyme 
(E2) that heterodimerizes with UEV1 A? (also known as UBE2V1) and 
synthesizes K63-linked polyubiquitin (K63Ub) chains at DSB sites in 
concert with the ubiquitin ligase (E3), RNF168 (ref. 3). K63Ub syn- 
thesis is regulated in a non-canonical manner by the deubiquitinating 
enzyme, OTUB1 (OTU domain-containing ubiquitin aldehyde- 
binding protein 1), which binds preferentially to the UBC13~Ub 
thiolester*. Residues amino-terminal to the OTU domain, which 
had been implicated in ubiquitin binding’, are required for binding 
to UBC13~Ub and inhibition of K63Ub synthesis’. Here we describe 
structural and biochemical studies elucidating how OTUB1 inhibits 
UBC13 and other E2 enzymes. We unexpectedly find that OTUB1 
binding to UBC13~Ub is allosterically regulated by free ubiquitin, 
which binds to a second site in OTUB1 and increases its affinity for 
UBC13~Ub, while at the same time disrupting interactions with 
UEVI1A in a manner that depends on the OTUB1 N terminus. 
Crystal structures of an OTUB1-UBC13 complex and of OTUB1 
bound to ubiquitin aldehyde and a chemical UBC13~Ub conjugate 
show that binding of free ubiquitin to OTUB1 triggers conforma- 
tional changes in the OTU domain and formation of a ubiquitin- 
binding helix in the N terminus, thus promoting binding of the 
conjugated donor ubiquitin in UBC13~Ub to OTUB1. The donor 
ubiquitin thus cannot interact with the E2 enzyme, which has been 
shown to be important for ubiquitin transfer®’. The N-terminal helix 
of OTUB1 is positioned to interfere with UEV1A binding to UBC13, 
as well as with attack on the thiolester by an acceptor ubiquitin, 
thereby inhibiting K63Ub synthesis. OTUB1 binding also occludes 
the RING E3 binding site on UBC13, thus providing a further 
component of inhibition. The general features of the inhibition 
mechanism explain how OTUB1 inhibits other E2 enzymes* in a 
non-catalytic manner. 

OTUBI was previously identified as a K48 linkage-specific deubiqui- 
tinating enzyme that contains two distinct ubiquitin-binding sites 
(Fig. la): a distal site and a proximal site that includes the ~45 
N-terminal residues of OTUB1 (ref. 5). These residues are important 
for OTUB1 inhibition of E2 activity* and are absent in OTUB2, which 
does not inhibit UBC13 (ref. 4). It was previously shown that binding 
of the covalent inhibitor, ubiquitin aldehyde (Ubal), to the distal 
ubiquitin-binding site of OTUB1 stimulates binding of ubiquitin vinyl 
sulfone to the N terminus’. Because the OTUB1 N terminus was 
implicated in binding to the donor ubiquitin in the UBC13~Ub con- 
jugate*, we asked whether Ubal binding to OTUB1 could enhance 
inhibition of UBC13 by stimulating binding of the OTUB1 N terminus 
to the donor ubiquitin. The results (Fig. 1b) showed a marked enhance- 
ment of the ability of OTUB1 to suppress K63Ub synthesis, indicating 
that Ubal is an allosteric effector that increases the affinity of the 
OTUBI N terminus for the ubiquitin in the UBC13~Ub thiolester. 
This prompted us to ask whether free ubiquitin binding to the OTUB1 
distal site could similarly stimulate binding of OTUB1 to UBC13~Ub 


conjugates. To test this, we generated a mixture of charged and 
uncharged UBC13(C87S), which forms a more stable UBC13~Ub 
oxyester, purified away the free ubiquitin, and performed pull-down 
assays with H.-OTUBI in the presence and absence of added free 
ubiquitin. Remarkably, OTUB1 shows no preference for the charged 
UBC13~Ub in the absence of ubiquitin, whereas addition of 100 LUM 
free ubiquitin greatly enhances OTUB1 binding to UBC13~Ub, but 
not to uncharged UBC13 (Fig. 1c). By contrast, ubiquitin bearing 
hydrophobic patch mutations 144A, L8A or L8A/I44A/R42A (but 
not R42A alone) do not stimulate OTUB1 binding to UBC13~ Ub like 
wild-type ubiquitin (Fig. 1c). The relative binding of OTUB1 to 
UBC13~Ub increases as the concentration of free ubiquitin is 
increased from 2 to 50M (Supplementary Fig. 2). To verify that 
ubiquitin binding to the distal site of OTUB1 is important for inhibi- 
tion of UBC13, we assayed the effect of distal site mutations, which 
were chosen based on structures of a covalent yeast Otul-ubiquitin 
complex® and of human OTUBI (ref. 9). Distal site substitutions 
F193W, F193R and H217W disrupted the ability of OTUB1 to inhibit 
polyubiquitination by UBC13-UEVI1A (Fig. 1d) without affecting 
binding of OTUB1 to UBC13 (Supplementary Fig. 3). Taken together, 
our results indicate that the ability of OTUB1 to bind preferentially to 
the UBC13~Ub conjugate and inhibit ubiquitin transfer is allosteri- 
cally regulated by free ubiquitin binding to the distal site of OTUB1 
(Fig. la), which triggers capture of the conjugated ubiquitin in the 
OTUBI proximal site. 

Because ubiquitin aldehyde most probably enhances interactions 
between the OTUB1 N terminus and the donor ubiquitin in 
UBC13~Ub, we examined the effect of N-terminal deletions in 
OTUB1 to delimit the minimal fragment needed for binding and 
inhibition. Deletion of residues 1-15 has no effect on inhibition of 
K63Ub synthesis by UBC13-UEV1A (Fig. le) whereas deletion of 30, 
37 or 41 residues significantly disrupts inhibition. The OTUB1A15 
deletion similarly behaves like full-length OTUB1 in pull-downs with 
the UBC13~Ub conjugate whereas larger deletions exhibit defects 
(Supplementary Fig. 4), indicating that N-terminal residues 16-45 
are sufficient for activity. 

Because a UEV (ubiquitin E2 variant) must bind to UBC13 and 
position the acceptor ubiquitin for K63Ub synthesis to occur'®, we 
asked whether OTUB1 could bind to UBC13 in the presence of 
UEVIA. In gel filtration assays using fluorescently labelled UEV1A, 
OTUB1 and uncharged UBC13 migrate as a ternary complex with 
UEVIA (Fig. 1f). To assay binding to charged UBC13, we generated 
a non-hydrolysable conjugate in which Ub with a carboxy-terminal 
G75C is covalently linked to the active-site cysteine of UBC13 with 
dichloroacetone (DCA)!!. UEV1A binds to UBC13?“4~Ub but 
OTUB1-Ubal interferes with UEV1A binding to the UBC13°“*~Ub 
conjugate (Fig. 1g). By contrast, the N-terminal deletion, OTUB1A37, 
can still form a complex with UBC13°©“~ Ub and labelled UEV1 in the 
presence of Ubal (Fig. 1h), indicating that the N terminus of OTUB1 
competes with UEV binding when OTUBI is bound to Ubal. We 
verified that free ubiquitin has a similar effect on UEV binding by 
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Figure 1 | Allosteric regulation of OTUB1 by ubiquitin. a, Schematic 
diagram of OTUB1 illustrating proximal and distal ubiquitin binding sites. 

b, Effect of ubiquitin aldehyde (Ubal) on the ability of human OTUB1 to inhibit 
K63 polyubiquitin synthesis by UBC13-UEV1A. Assays include 0.1 1M E1, 
0.4 uM UBC13-UEVI1A, 0.5 1M human OTUB1, 5 uM ubiquitin. The 3 h time 
point is shown in the presence (right) and absence (left) of human OTUB1, 
without (—) and with (+) 0.5 uM Ubal. Top shows detection by anti-Ub 
western blot; Coomassie staining below shows level of human OTUBL. ¢, Pull- 
down assay showing binding of H,-tagged human OTUBI to a mixture of 
UBC13 and UBC13~Ub oxyester in the presence and absence of 100 iM free 
ubiquitin (wild type (WT) or mutant). d, Effect of human OTUB1 distal site 
mutations on inhibition of K63Ub synthesis. Assay performed as in b but with 


comparing migration of a sample containing labelled UEV1, 
UBC13?“*~Ub and OTUBI prepared in the presence and absence 
of free ubiquitin and found that the ratio of free UEV1 to UEV1- 
UBC13°“*~Ub-OTUB1 increases when ubiquitin is present 
(Fig. 1i). Similarly, pull-downs with He-OTUB1 do not show an 
enhancement in coprecipitation of UEV1A along with UBC13~Ub 
in the presence of added free ubiquitin (Supplementary Fig. 5). These 
results indicate that the N terminus of OTUB1 interferes with UEV 
binding and thus with K63Ub synthesis, and that the ability of the N 
terminus to interfere with UEV depends upon a conformational change 
that is triggered by binding of free ubiquitin to OTUB1. 

To determine the structural basis for OTUB1 inhibition of E2 
enzymes, and how ubiquitin allosterically regulates OTUB1 activity, 
we determined the structure of Caenorhabditis elegans OTUB1 (worm 
OTUB1) bound to human UBC13 at 1.8 A resolution (Fig. 2a), and a 
2.35A resolution quaternary complex structure containing worm 
OTUBI1, Ubal and a UBC13°“~Ub conjugate generated with 
Ub(G76C). The resulting non-native linkage is four bond lengths longer 
than the native thiolester (Supplementary Fig. 6). Human UBC13 is 
89% identical to worm UBC13, whereas human OTUB1 shares 34% 
sequence identity and 56% similarity with worm OTUB1 (Supplemen- 
tary Fig. 7) and inhibits K63Ub chain formation by human UBC13- 
UEVI1A (Supplementary Fig. 8a). Worm OTUB1 is a weaker inhibitor 
of UBC13, as reflected in its higher Ky of 58.5 LM compared to 7.04 UM 
for human OTUB1 (Supplementary Fig. 8b). Crystals of the worm 
OTUB1-Ubal-UBC13°“"~Ub complex contain four complexes in 
the P2,2,2) asymmetric unit. The ubiquitin conjugated to UBC13 could 
be unambiguously positioned in two of the four complexes (Sup- 
plementary Fig. 9); our discussion focuses on the complex with the 
most well-ordered ubiquitin (complex 1). Because the N terminus of 
OTUB1 that plays a key role in inhibition is poorly conserved between 
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1uM OTUBL. e, Effect of human OTUBI1 N-terminal deletions of 15, 30, 37 
and 42 residues on inhibition of K63Ub synthesis by UBC13-UEV1A. Assay 
performed as in d. f, Gel filtration showing complex formation between 
fluorescein-labelled UEV1A (UEV1A*), UBC13 and human OTUB1. Signal 
due to UEV1A only was monitored at 495 nm. g, Experiment performed as in 
f showing binding to UEV1A* by UBC13°“*~Ub(G75C) in the absence (red) 
and presence (green) of OTUB1-Ubal. h, Experiment performed as in f but 
with human OTUB1A37. The position at which free UEV1* migrates is 
indicated. i, Experiment performed as in f with fluorescein-labelled UEV mixed 
with UBC13?©4~Ub(G75C) and OTUB1 samples prepared in the presence 
and absence of 200 1M ubiquitin. The position at which free UEV1* migrates is 
indicated. 


human and worm OTUBI, we also determined the 3.1 A resolution 
structure ofa quaternary complex with a hybrid OTUB1 containing the 
N-terminal 45 residues of human OTUB1 and the OTU domain of 
worm OTUB1 (Supplementary Fig. 7b). The hybrid human/worm 
OTUB1 inhibits K63Ub synthesis by UBC13-UEV1A (Supplemen- 
tary Fig. 10). Details on all structure determinations are in Supplemen- 
tary Methods and statistics are in Supplementary Table 1. 

In the structure of apo worm OTUB1 bound to UBC13 (Fig. 2a), the 
OTU domain of worm OTUB1 binds to UBC13 in an orientation that 
places their respective active-site cysteines 28 A apart on the same face 
of the complex, burying 1,280 A? of total surface area. Of the 12 worm 
OTUBI side chains at the interface with UBC13 (Fig. 2b), seven are 
identical in human OTUB1 and four are similar (Supplementary Fig. 7a) 
and can mediate comparable interactions with UBC13. Consistent 
with this, the double substitution Y170A/F138A in human OTUB1 
(Y168A/F135A in worm OTUB1) is defective in binding to UBC13 
(Supplementary Fig. 11). Similar interactions could form between 
OTUB1 and UBE2D2 (also known as UBCHS5B) (Fig. 2c), but clashes 
due to an insertion and a non-conserved lysine would arise with 
UBE2L3 (also known as UBCH7), consistent with the observation that 
OTUB1 inhibits UBCH5B but not UBCH7 (ref. 4). 

An overview of the human/worm OTUB1-Ubal-UBC13°™*~Ub 
complex is shown in Fig. 2d, e. Ubal binds to the OTUB1 distal site 
while the donor ubiquitin in the UBC13~Ub conjugate binds in the 
OTUB1 proximal site, which comprises residues in both the OTU 
domain and the N terminus. In the absence of bound ubiquitin, the 
worm OTUBI1 N terminus (residues 1-37, corresponding to human 
OTUB1 residues 1-39) is disordered (Fig. 2a). However, in the 
OTUB1-Ubal-UBC13"“*~Ub complexes, part of the N terminus 
of OTUB1 becomes ordered, forming a ubiquitin-binding helix that 
contacts the donor ubiquitin in the distal site (Fig. 2e). Additional 
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Figure 2 | Structure OTUB1-UBC13 and OTUB1-Ubal-UBC13°“*~Ub. 
a, Complex of worm OTUBI (green) bound to human UBC13 (blue). 
Respective active-site cysteines are shown as space-filling representations. 
Dashed line indicates disordered residues. b, Contacts at worm OTUB1 
(green)-UBC13 (blue) interface. c, Superposition of UBCH5B (UBE2D2, PDB 
ID 2ESK) and UBCH7 (UBE2L3, PDB ID 1FBV) with UBC13 in the complex 
with worm OTUB1. UBCH7 contains an insertion (at N94) and a lysine (K96) 
that would interfere with binding. d, Structure of hybrid human/worm OTUB1 
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Figure 3 | Conformational changes in the OTU domain triggered by Ubal 
binding. a, Superposition of worm OTUB1 (green) bound to Ubal (yellow 
surface) with the structure of apo worm OTUB1 (grey). Dotted circles indicate 
regions of conformational change, which are illustrated in the figure panels 
noted. b, Location of human OTUB1 distal site mutations that affect inhibition. 
The structure of human OTUB1 (2ZFY; brown) is superimposed on worm 
OTUBI (green)-Ubal (yellow). Ubiquitin residues L8 and 144, where 
substitutions with alanine disrupt allosteric effect of ubiquitin binding, are 
shown. View is 180° rotation about vertical compared with a. c, Structural 
differences in the OTU domain in the presence (green) and absence (grey) of 


Human OTUB1 - apo (2ZFY) 
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(green) bound to Ubal (distal Ub, yellow), UBC13 (blue) and ubiquitin 
(proximal Ub, red) that is covalently linked to the active-site cysteine (C87) of 
UBC13 by a DCA linkage. Dashed line indicates disordered C-terminal 
residues 73-76 of the donor ubiquitin and DCA linkage. e, A 90° rotation 
compared to d showing positions of worm OTUB1 and UBC13 active-site 
cysteine and modelled location of K48 of the proximal ubiquitin. f, Contacts 
between the donor ubiquitin (red) and the OTU domain (green) in the worm 
OTUB1-Ubal-UBC13?“*~Ub complex. 


d 


c 
Distal Ub 


Worm OTUB1 
Worm OTUB1 (apo) 


(proximal) 


distal Ub that affect contacts with the donor Ub. Arrows indicate 
conformational changes. Dotted lines indicate hydrogen bonds and salt 
bridges. View shown is from ‘top’ of complex as shown on right of panel 

a, rotated 90° counter-clockwise. d, Effect of mutating OTUB1 conserved 
arginine, worm OTUB1(R236E) and human OTUB1(R238E), on inhibition of 
UBC13-UEVIA. Assay performed as in Fig. 1b, with 1 1M human OTUB1 and 
15 uM worm OTUB1. e, View of OTU domain structural rearrangements 
coloured as in c. View as in panel a; proximal ubiquitin not shown. f, Detailed 
view of catalytic triad in the presence and absence of Ubal (carbon coloured as 
inc). 
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contacts with the donor ubiquitin are mediated by the OTU domain 
which, as described below, undergoes a set of conformational changes 
triggered by Ubal binding to the distal site. 

The donor ubiquitin binds to the proximal site of OTUB1 (Fig. 2d) in 
an orientation that places K48 of the ubiquitin near the OTUB1 active 
site (Fig. 2e). A K48 isopeptide linkage can be modelled between the 
proximal and distal ubiquitins, consistent with OTUB1 isopeptidase 
specificity for K48-linked diubiquitin’. Residues 73-76 of the donor 
ubiquitin and the DCA linkage are not visible in the electron density 
map, indicating that they do not adopt a unique conformation in the 
crystal. The distance between the C-terminal ubiquitin residue and the 
active-site cysteine is approximately 12.5 A, which is sufficient to accom- 
modate the four missing residues and a native thiolester linkage. The 
donor ubiquitin interface with the OTU domain buries 850 A? of surface 
area. Ubiquitin side chains that lie between residues 54-60 contact the 
OTU domain, forming both direct and water-mediated hydrogen bonds 
and van der Waals interactions (Fig. 2f). Three of the contacting worm 
OTUBI side chains are R236, Y233 and D235, which are only in a 
position to contact ubiquitin in the quaternary complex. 

The observed contacts between the donor ubiquitin and the OTU 
domain depend upon distal site binding of Ubal, which forms a covalent 
bond with the active-site cysteine (Supplementary Fig. 12) and triggers 
conformational changes in three regions of the globular OTU domain 
(Fig. 3a). Ubal binds to the distal ubiquitin binding site of OTUB1 
(Fig. 3b) in a manner similar to yeast* and viral'*'* OTU enzymes, 
and accounts for the effects of mutations in the OTUB1 distal site 
(Fig. 1d). A loop (residues 235-245) that partially occludes the distal 
site in the absence of ubiquitin undergoes a large rearrangement that 
relieves steric clashes with the distal ubiquitin and positions R236 of 
worm OTUB1 to make multiple contacts with the donor ubiquitin 
bound in the proximal site of OTUB1 (Fig. 3c). In the structure of 
apo human OTUBI (ref. 9), this residue is disordered (backbone and 
side-chain atoms) and lies in a loop that presumably changes conforma- 
tion upon distal ubiquitin binding. Mutating the conserved arginine to 
glutamic acid in both human (R238E) and worm (R236E) OTUB1 
disrupts inhibition (Fig. 3d), consistent with its role in binding the donor 
ubiquitin. Interestingly, the corresponding residue is a glutamic acid in 
OTUB2, which lacks an N-terminal arm and does not inhibit UBC13 
(ref. 4). Y233, which occludes the distal site in apo worm OTUB1 and 
undergoes a conformational change to hydrogen bond with the distal 
Ub (Fig. 3c), is conserved in human OTUB1 (Supplementary Fig. 7a). 
Another set of conformational changes in the loop connecting helices 1 
and 2 of OTUBI flips the solvent exposed Y57 side chain into the 
interior of the OTU domain, where it stacks between F65 and E56 
(Fig. 3e). The altered loop conformation relieves steric clashes with 
the donor ubiquitin that would otherwise occur. Binding of the distal 
ubiquitin is accompanied by additional local rearrangements that 
narrow the binding cleft around the ubiquitin C-terminal tail (Fig. 3e) 
and moves the worm OTUB] active-site histidine, H267, into a position 
between D269 and C88 to activate the cysteine for catalysis (Fig. 3f). 

The OTUBI1 N-terminal ubiquitin-binding helix seen in the struc- 
ture spans residues 28-39 of worm OTUB1 {complex 1) and 25-44 of 
human OTUBI (Figs 4a-c), burying 542 A? and 626 A’, respectively, 
on the donor ubiquitin (electron density shown in Supplementary 
Fig. 13). The helix interacts with the donor ubiquitin in a manner 
reminiscent of the RAP80 UIM" (Fig. 4d). Despite limited sequence 
identity between the worm OTUB1 and human OTUBI N terminus 
(Fig. 4a), the three side chains that contact the donor ubiquitin in the 
2.35 A resolution structure of worm OTUB1 (Fig. 4b) are conserved in 
human OTUBI1 (Fig. 4a) and are oriented towards ubiquitin in the 
same manner in the 3.1 A resolution human/worm OTUBI structure 
(Fig. 4c). In the worm OTUB1 complex (Fig. 4b), residues E37 and 134 
contact donor ubiquitin residue H68 while Q33 interacts with back- 
bone atoms. In the structure containing the human N terminus, the 
helix extends beyond the donor ubiquitin and approaches the UBC13 
active-site cysteine (Fig. 4c). It is possible that additional residues may 
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be ordered when the complex is in solution, as nine residues from the 
minimal human OTUB1A15 fragment that exhibits full activity 
(Fig. 1d) are missing from the human/worm OTUB1 complex struc- 
ture. It is not clear whether the shorter helix observed in the worm 
OTUBI1 complex reflects a structural difference in solution, or whether 
crystal contacts interfere with helix formation. The close approach of 
the OTUB1 N terminus to the donor ubiquitin C terminus in both 
complexes (Figs 4b, c) leaves open the possibility that additional 
contacts may form with the donor ubiquitin tail linked to UBC13 
via a native thiolester. 

The structures show how OTUB1 interferes with UEV binding and 
positioning of the acceptor ubiquitin, and also occludes the RING E3 
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Figure 4 | OTUB1 N-terminal arm and the mechanism of E2 inhibition. 

a, Sequence alignment of N-terminal arms of human OTUB1 and worm 
OTUBI1. Boxed residues form a helix in the quaternary complex structures 
containing Ubal and UBC13?°4~Ub; additional shaded residues in worm 
OTUB1 are ordered in complex 1 but are not helical. b, Donor Ub (red) 
interactions with the worm OTUB1 N-terminal helix (green); UBC13 shown in 
blue. Dashed lines indicate disordered residues. c, Interactions with the human 
OTUB1 N-terminal helix of the human/worm OTUB1 hybrid, depicted as in 
b. d, Superposition comparing RAP80 (grey, PDB ID 3A1Q) binding to 
ubiquitin (red) with human OTUB1 N-terminal helix (green). Two views are 
shown. e, Superposition of human/worm OTUB1-Ubal-UBC13?*~ Ub with 
UBC13-UEV1 (1J7D) showing predicted position of UEV1 (grey). The 
solvent-accessible surface of the human N-terminal arm residues of OTUB1 is 
depicted. f, Modelled position of attacking K63 in acceptor Ub (cyan) based on 
yeast Ubc13~Ub-Mms2 (2GMDI). g, Superposition with quaternary complex 
showing relative position of the TRAF6 E3 ligase (3HCT). 
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binding site. Figure 4e shows a superposition with the structure of a 
UBC13-UEV1 complex"* showing that the N-terminal helix of human 
OTUBI clashes with the expected location of UEV1. Modelling of the 
predicted position of the acceptor ubiquitin based on the structure of 
yeast UBC13~Ub-Mms2 (ref. 17) shows the N terminus of OTUB1 ina 
position to interfere with attack by the acceptor ubiquitin lysine on the 
thiolester (Fig. 4f). Because OTUB1 also inhibits UBCH5B*, which does 
not function with a UEV, we propose that the OTUB1 N terminus may 
also interfere with acceptor ubiquitin binding for other E2s. The re- 
positioning of the donor ubiquitin away from the E2 also probably 
contributes to inhibition, in light of evidence that the donor ubiquitin 
in the E2-Ub thiolester interacts specifically with the E2 (refs 18, 19) and 
that this is essential for ubiquitin transfer®’. In addition, superposition 
with the structure of UBC13 bound to TRAF6 (ref. 20) shows that the 
OTUBI binding site overlaps with the E3 RING-binding site (Fig. 4g), 
indicating that competition between OTUB1 and RNF168 would 
further suppress UBC13 activity in vivo. Competition with E3 binding 
is likely to be particularly important for OTUB1 inhibition of UBCH5B 
which, unlike UBC13, is strictly dependent upon an E3 ligase for activity. 

The ability of OTUB1 to serve as both an isopeptidase and an 
inhibitor of E2 enzyme activity arises from its ability to bind to selected 
E2s, while taking advantage of the allosteric communication between 
the proximal and distal ubiquitin binding sites of OTUB1 and the 
distinctive features of its N terminus. Given the high degree of coupling 
between the multiple binding interactions within the OTUB1-Ub- 
UBC13~Ub complex, the degree of inhibition in vivo will clearly 
depend upon the relative concentrations of OTUB1, E2~ Ub thiolester, 
E3 and free ubiquitin in the cell. An interesting question is whether the 
dependence of OTUBI repression on ubiquitin binding to the distal 
site is exploited to modulate OTUB1 activity in response to fluctua- 
tions in the concentration of free ubiquitin or of free chains, whose 
C-terminal subunits could similarly bind to the distal site of OTUB1. 
Our findings establish new directions for investigating how the 
allosteric regulation of OTUB1 may be exploited to regulate ubiquiti- 
nation in the DNA damage response. 


METHODS SUMMARY 


Cloning, expression, protein purification and crystallization are described in 
Methods and in accompanying references’. The DCA linkage between the active- 
site cysteine of UBC13 and a C-terminal cysteine in Ub(G75C) or Ub(G76C) was 
generated by a modification of the published method". The hybrid human/worm 
OTUBI protein contains residues 1-45 of human OTUB1 and residues 43-276 of 
worm OTUBL. Structures were determined by molecular replacement as described 
in Methods. Free ubiquitin chain synthesis was assayed by gel electrophoresis and 
products were detected by western blot with anti-ubiquitin antibody or by 
Coomassie staining. Pull-down assays were performed with purified recombinant 
protein. Assays of complex formation between OTUB1, UBC13, UBC13?“~Ub 
and UEVIA were performed by gel filtration with fluorescein-labelled UEV1A or 
UEV1, monitoring fluorescein absorbance at 495 nm. Binding of OTUB1 to UBC13 
was measured by fluorescence anisotropy using fluorescently labelled UBC13, and 
equilibrium dissociation constants were calculated using SigmaPlot (SPSS). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Cloning and mutagenesis. Cloning of human and C. elegans OTUB1 (human 
OTUB1 and worm OTUBI, respectively) was performed as described previously’. 
The human UBC13 open reading frame was amplified from a human comple- 
mentary DNA library and cloned into a pET vector containing an N-terminal 
Hisg-SUMO-2 tag (pETSUMO-2) The human UEVIA ORF was synthesized 
(Integrated DNA Technologies) and subcloned into the pETSUMO-2 vector as 
above. The human UEV1 (missing the first 30 residues of UEV1A) expression 
plasmid was purchased from Addgene (http://www.addgene.org) 

Mutants ofhuman OTUB1 were generated by site-directed mutagenesis using the 

QuikChange mutagenesis kit (Stratagene) following the manufacturer’s protocol. 
The hybrid human/worm OTUBI was generated by swapping the first 41 residues 
of worm OTUBI with the first 45 residues of human OTUB1 using Infusion ligase- 
free cloning (Clontech). Human OTUB1 with an N-terminal 41-residue truncation 
(OTUB1AN41) was generated as previously described’, all other OTUB1 deletions 
were generated using Infusion ligase-free cloning (Clontech). 
Protein expression and purification. All proteins were expressed in Escherichia 
coli Rosetta-2 (DE3) cells grown in Luria-Bertani (LB) medium. Cultures were 
inoculated using 1% (v/v) overnight saturated cultures and were grown at 37 °C to 
an OD6goo of 0.8. Proteins were induced at 16°C overnight by addition of 1 mM 
isopropyl-B-D-thio-galactoside (IPTG). Cells were harvested by centrifugation 
(8,000g, 15 min) and either lysed immediately or stored at —80 °C for later use. 

Human OTUBI, worm OTUBI, human E1 enzyme and ubiquitin were purified 
as previously described**'. Deletions and mutants of human and C. elegans 
OTUB1 and of ubiquitin were purified according to the same protocol as the 
wild-type proteins. UBC13 and UEV1A were purified by resuspending cell pellets 
in lysis buffer (20 mM HEPES pH 7.3, 300mM NaCl, 10 mM imidazole, 2mM 
B-mercaptoethanol) after adding 0.1mM phenyl-methyl sulphonyl fluoride 
(PMSF). Cells were disrupted using a Microfluidizer (Microfluidics) and the lysate 
was centrifuged to remove cell debris. The lysate was subjected to immobilized 
metal affinity chromatography (IMAC) using 5ml His-Trap columns (GE 
Biosciences) developed with a linear imidazole gradient of 25-400 mM in 20 
column volumes. Fractions containing purified protein were pooled, SENP-2 
protease was added in a ratio of 1:100 to cleave off the His-SUMO-2 tag, and 
pooled fractions were dialysed overnight at 4°C against lysis buffer. Cleaved 
protein was then subjected to a second round of IMAC and the cleaved protein 
was collected from the flow-through. Proteins were then purified by gel filtration 
on a Superdex 75 column (GE Healthcare), dialysed into 20 mM HEPES, pH 7.3, 
150mM NaCl and 1mM dithiothreitol (DTT), concentrated and stored at 
—80°C. Proteins for crystallization, enzyme assays and binding studies were 
>98% pure as visualized on a Coomassie-stained gel. His,-tagged human 
OTUB1 used in pull-down assays was ~90% pure. 

Protein modifications. UBC13, UEV1A and UEV1 were labelled with fluorescein- 
5-maleimide (Invitrogen) as described in the manufacturer’s protocol. Ubiquitin 
aldehyde was prepared as described”. 

Preparation of UBC13~Ub conjugates. UBC13(C87S)~ Ub oxyester was pre- 
pared as previously described'’. The UBC13°“*~Ub covalent conjugate was 
prepared according to a modification of the protocol from ref. 11. Purified ubi- 
quitin containing the substitution G76C (Ub(G76C)) or G75C (Ub(G75C)) and 
UBC13 were dialysed separately overnight into 20mM sodium borate buffer, 
pH 8.0 and 2 mM TCEP (tris(2-carboxyethyl)phosphine), mixed in the proportion 
of 1mM Ub(G76C) or Ub(G75C) to 330 4M UBC13, and incubated on ice for 
15 min. A stock of 20 mM 1,3-dichloroacetone (DCA) was prepared in dimethyl- 
formamide (DMEF) and added to the conjugation reaction to a final concentration 
of 0.8 mM DCA. The reaction was stopped after 1h by addition of 10 mM B-mer- 
captoethanol. The coupling efficiency was approximately 50%. For the Ub(G76C) 
reaction, the mix was diluted tenfold with 10 mM Tris, pH 8, loaded onto a mono 
Q column (GE Healthcare) pre equilibrated with 10mM Tris, pH8. Free 
Ub(G76C) eluted in the flow-through and UBC13?“*~Ub eluted together with 
unconjugated UBC13 in 180mM NaCl in 20 mM Tris, pH 8. For the Ub(G75C) 
reaction, UBC13°“*~Ub(G75C) was purified by gel filtration on a Superdex 75 
column pre-equilibrated with 20 mM HEPES pH7.3, 100 mM NaCl and 2mM 
DTT. The separation efficiency was about 10% of the total amount of 
UBC13?°4~Ub(G75C) in the reaction mix. 

Purification of worm OTUB1-Ubal-UBC13°“*~Ub(G76C) quaternary com- 
plex. Worm OTUB1 was incubated on ice with Ubal ina 1:4 molar ratio for 15 min 
and added to the purified apo human UBC13 and UBC13?“*~Ub mixture such 
that UBC13?°4~ Ub was in twofold excess over worm OTUBI, as estimated by gel 
electrophoresis. The mixture was incubated for another 15 min on ice and loaded 
onto a Superdex 200 column (GE Healthcare) pre-equilibrated with 20 mM Tris, 
pH7.45, 150mM NaCl and 2mM DTT. The OTUB1-Ubal-UBC13?“~Ub 
complex eluted as a single peak and was concentrated to 10 mg ml ' and stored 
at —80°C. 


Crystallization. All crystals were grown by the hanging-drop vapour diffusion 
method at 20°C. A worm OTUB1-UBC13 complex was prepared by incubating 
worm OTUB1 and human UBC13 at a molar ratio of 1:1 and total protein con- 
centration of 26mg ml ‘ for 10 min at room temperature. Crystals were grown 
from a 1:1 mix of protein and well solution containing 100 mM sodium cacodylate, 
pH6.5 and 1 M trisodium citrate and appeared in about 2-3 days. Crystals were 
transferred to cryoprotectant consisting of well solution plus 20% ethylene glycol 
and then flash-frozen in liquid nitrogen. 

Crystals of the worm OTUB1-Ubal-UBC13°“*~Ub complex were grown 
from a 1:1 mix of complex (10mg ml‘) with well solution containing 100 mM 
Bis-Tris pH6.5, 23% PEG 3350 and 0.26-0.3M sodium chloride. Crystals 
appeared in about 1-2 days, were cryoprotected by well solution with added 
15% ethylene glycol and then flash-frozen in liquid nitrogen. 

Crystals of the human/worm OTUB1-Ubal-UBC13°“*~Ub complex were 
grown from a 1:1 mix of complex (10mg ml‘) with well solution containing 
100mM MES pH6.5, 21% PEG 10,000 and 0.1M sodium chloride. Crystals 
appeared in about 2-3 days, were cryoprotected by well solution with added 
15% ethylene glycol and then flash-frozen in liquid nitrogen 
Structure determination. Diffraction data were recorded at the GM/CA-CAT 
beamline 23-ID-D/B at the Advanced Photon Source under standard cryogenic 
conditions and processed with iMOSFLM” for worm OTUB1-human UBC13 
crystals and HKL2000** for the worm OTUB1-Ubal-UBC13°“*~Ub crystals. 
For the worm OTUB1-Ubal-UBC13°“*~Ub structure two data sets were col- 
lected from a single crystal, merged and processed with HKL2000™. All data were 
collected at a wavelength of 1.033 A. The structure of worm OTUBI-UBC13 was 
determined by molecular replacement with Phaser” using structures of UBC13 
(1J7D) and apo human OTUB1 (2ZFY). The structure of worm OTUB1-Ubal- 
UBC13?“*~Ub was determined by molecular replacement with Molrep”® using 
structures of the worm OTUB1-human UBC13 complex and ubiquitin (from 
2GMI). The initial molecular replacement search performed with the worm 
OTUB1-UBC13 complex located four complexes in the asymmetric unit. The 
resulting positions of the OTUB1-UBC13 complexes were then fixed and ubiquitin 
was used as search model to locate the four ubiquitin aldehydes in the crystal. The 
position of the worm OTUB1-Ubal-UBC13 complex was fixed and another search 
with ubiquitin (1-71) located two molecules of donor ubiquitin in the asymmetric 
unit. The structure of human/worm OTUB1-Ubal-UBC13°“"~ Ub was deter- 
mined by molecular replacement with Phaser using one complex of worm 
OTUB1-Ubal-human UBC13~Ub lacking the first 42 residues of worm OTUBI. 

All structures were subjected to multiple rounds of manual correction and 
refinement using COOT” and Phenix”. The final stages of refinement for the 
worm OTUBI-Ubal-UBC13°™~Ub complex and human/worm OTUBI- 
Ubal-UBC13°“*~Ub_ ternary complex were done using REFMAC5”. 
Simulated annealing omit maps were calculated with CNS* and used to verify 
selected portions of the model. 

The final model of worm OTUB1-human UBC13 complex includes residues 
38-275 of worm OTUB1 and 3-152 of human UBC13. The final model of worm 
OTUB1-Ubal-UBC13?“*~Ub includes four complexes in the asymmetric unit: 
two containing all four proteins (worm OTUBI, Ubal, human UBC13 and Ub) 
and two lacking the donor ubiquitin conjugated to UBC13. There is no density in 
any of the complexes corresponding to the five C-terminal amino acids of ubiqui- 
tin or to the DCA linkage, which connects ubiquitin to the human UBC13 active- 
site cysteine. The number of worm OTUBI N-terminal residues visible in the map 
differs among the four complexes as follows: complex 1, 28-275; complex 2, 31- 
275; complex 3, 36-276; complex 4, 38-276. The final model of the human/worm 
OTUB1-Ubal-UBC13°“*~Ub complex includes residues 20-275 of human/ 
worm OTUB1, 3-151 of human UBC13, 1-76 of Ubal and 1-72 of ubiquitin. 

Protein-protein interaction surfaces were analysed using the PISA server at EBI 
(http://www.pdbe.org/PISA) and manually inspected using COOT and PYMOL 
(http://www.pymol.org). Figures were generated with PYMOL. 

Fluorescence polarization binding assay. Fluorescein-labelled human UBC13 
(20nM) was incubated with increasing concentrations of human OTUB1 wild 
type or mutants in 20 mM Tris, pH 7.6, 150 mM NaCland 10 mM B-mercaptoethanol. 
Polarization measurements were recorded at 25°C with an ISS Chronos 
Fluorescence Lifetime Spectrometer at excitation and emission wavelengths of 
492 and 520nm, respectively. Binding data were analysed and Kg values were 
calculated by nonlinear regression in SigmaPlot (SPSS). 

UEV binding assay. UEV1A and UEV1 were fluorescein-labelled; all other proteins 
are unlabelled. The experiment was performed with an analytical Superdex 75 
column pre-equilibrated with 20 mM HEPES pH 7.3, 100mM NaCl and 2mM 
DTT. Absorbance was detected at 495nm to monitor the presence of labelled 
UEVI1A or UEVI. For each run, proteins were incubated for 20 min on ice 
before loading onto the column. The protein concentrations used in the different 
experiments were: Fig. le, UEV1A 20 1M, UBC13 40M and human OTUB1 
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100 uM; Fig. 1f, UEV1A 10 uM, UBC13~Ub 10 uM, human OTUB1 50 1M and 
Ubal 50 uM; Fig. 1g, UEV1 20M, UBC13~Ub 20M, human OTUB1(A37) 
100 4M and Ubal 100 uM; Fig. lh, UEV1 20 1M, UBC13~Ub 201M, human 
OTUB1 100 uM and ubiquitin 200 uM. 

In vitro ubiquitination assay. Ubiquitination assays were performed in 25 mM 
Tris-HCl (pH 8.0) buffer containing 0.1 mM DTT, 1mM ATP, 2.5mM MgCh, 
5 mM creatine phosphate, 0.3 units ml’ inorganic pyrophosphatase, and 0.3 units 
mI’ creatine kinase. Proteins in the amounts of 0.4 1M UBC13, 0.4 uM UEV1A 
and 5 uM ubiquitin were mixed with human OTUB1 (14M) or worm OTUBI1 
(15 uM). Reactions were initiated by the addition of 0.1 1M E1 enzyme, incubated 
at 37 °C, and stopped at different time points by adding denaturing SDS-PAGE 
loading dye containing B-mercaptoethanol (BME). For Fig. 1b, 0.5 uM human 
OTUBI was incubated with 0.5 uM Ubal for 15 min before addition to the reaction. 
Reaction products were separated on a 4-12% Bis-Tris NuPAGE (Invitrogen) gel 
and transferred to a polyvinylidene fluoride (PVDF) membrane. Membranes were 
denatured in a 6 M guanidine HCl, 20 mM Tris-HCl, pH 7.5, 1 mM PMSF, 5 mM 
B-mercaptoethanol solution for 30 min at 4°C and then washed extensively in 
Tris-buffered saline and Tween 20 (TBST). Membrane were blocked overnight at 
4°C with 5% BSA in TBST and incubated for 1h with ubiquitin antibody (P4D1 
Santa Cruz) 1:1,000 at room temperature followed by anti-mouse horseradish 
peroxidase (HRP)-conjugated secondary antibody. OTUB1 was detected with 
Coomassie brilliant blue or SimplyBlue SafeStain (Invitrogen). 

Pull-down assays. Ni°*-NTA beads were equilibrated in buffer A (50 mM phos- 
phate buffer pH8.0, 100mM NaCl, 5mM f-mercaptoethanol and 10mM 
Imidazole). 6X His-human OTUB1 (30 1g) was incubated with pre-equilibrated 
beads in 200 jl of buffer A for 30 min. Beads were washed with 400 ul buffer A and 
incubated with a mixture of human UBC13 and human UBC13(C87S)~ Ub with 
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and without the indicated concentration of free ubiquitin (2-100 11M) in 200 ul 
buffer A for 1h. Beads were washed with 400 pl buffer A for 10 min and eluted 
with 25 pl of buffer A plus 250mM imidazole. Eluates were analysed by gel 
electrophoresis and staining with Coomassie blue or SimplyBlue SafeStain 
(Invitrogen). The pull-down in Supplementary Fig. 2 was performed as above except 
for the addition of 6X His-human OTUBI (7 1g), human UBC13(C87S)~ Ub (7 1g) 
and ubiquitin as indicated in the figure. 
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The translational landscape of mTOR 
signalling steers cancer initiation 


and metastasis 
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Morris E. Feldman®, Jonathan S. Weissman®, Kevan M. Shokat®, Christian Rommel® & Davide Ruggero! 


The mammalian target of rapamycin (mTOR) kinase is a master regulator of protein synthesis that couples nutrient 
sensing to cell growth and cancer. However, the downstream translationally regulated nodes of gene expression that 
may direct cancer development are poorly characterized. Using ribosome profiling, we uncover specialized translation 
of the prostate cancer genome by oncogenic mTOR signalling, revealing a remarkably specific repertoire of genes 
involved in cell proliferation, metabolism and invasion. We extend these findings by functionally characterizing a 
class of translationally controlled pro-invasion messenger RNAs that we show direct prostate cancer invasion and 
metastasis downstream of oncogenic mTOR signalling. Furthermore, we develop a clinically relevant ATP site 
inhibitor of mTOR, INK128, which reprograms this gene expression signature with therapeutic benefit for prostate 
cancer metastasis, for which there is presently no cure. Together, these findings extend our understanding of how 
the ‘cancerous’ translation machinery steers specific cancer cell behaviours, including metastasis, and may be 


therapeutically targeted. 


It is unknown whether specialized networks of translationally con- 
trolled mRNAs can direct cancer initiation and progression, thereby 
mirroring cooperativity that has mainly been observed at the level 
of transcriptional control. This is an important question, as key 
oncogenic signalling molecules, such as the mTOR kinase, directly 
regulate the activity of general translation factors’. Downstream of 
the phosphatidylinositol-3-OH kinase (PI(3)K)-AKT signalling 
pathway, mTOR assembles with either raptor or rictor to form two 
distinct complexes: mTORC1 and mTORC2 (refs 3, 4). The major 
regulators of protein synthesis downstream of mTORC1 are 4EBP1 
(also called EIF4EBP1) and p70S6K1/2 (refs 1, 2). 4EBP1 negatively 
regulates eIF4E, a key rate-limiting initiation factor for cap-dependent 
translation. Phosphorylation of 4EBP1 by mTORCI leads to its dis- 
sociation from eIF4E, allowing translation initiation complex forma- 
tion at the 5’ end of mRNAs*. The mTOR-dependent phosphorylation 
of p70S6K1/2 also promotes translation initiation as well as elonga- 
tion’. At a genome-wide level, it remains poorly understood whether 
and how activation of these regulators of protein synthesis may pro- 
duce specific changes in gene expression networks that direct cancer 
development. Here we use a powerful new technology known as 
ribosome profiling to delineate the translational landscape of the can- 
cer genome at a codon-by-codon resolution upon pharmacological 
inhibition of mTOR’. Our findings provide genome-wide character- 
ization of translationally controlled mRNAs downstream of oncogenic 
mTOR signalling and delineate their functional roles in cancer 
development. Moreover, we determine the efficacy of a novel clinically 
relevant mTOR inhibitor that we developed, INK128, which specif- 
ically targets this cancer program. 


Ribosome profiling of the prostate cancer genome 
mTOR is deregulated in nearly 100% of advanced human prostate 
cancers*, and genetic findings in mouse models implicate mTOR 
hyperactivation in prostate cancer initiation’"’. Given the critical role 
for mTOR in prostate cancer, we used PC3 human prostate cancer 
cells, where mTOR is constitutively hyperactivated, to delineate trans- 
lationally controlled gene expression networks upon complete or 
partial mTOR inhibition. We optimized ribosome profiling to assess 
quantitatively ribosome occupancy genome-wide in cancer cells’. In 
brief, ribosome-protected mRNA fragments were deep-sequenced to 
determine the number of ribosomes engaged in translating specific 
mRNAs (Supplementary Fig. la and Methods). Treatment of PC3 
cells with PP242 (refs 12, 13), an mTOR ATP site inhibitor, signifi- 
cantly inhibits the activity of the three primary downstream mTOR 
effectors 4EBP1, p70S6K1/2 and AKT. On the contrary, rapamycin, 
an allosteric mTOR inhibitor, only blocks p70S6K1/2 activity in these 
cells (Supplementary Fig. 1b). We used short 3-h drug treatments, 
which precede alterations in de novo protein synthesis, to capture 
direct changes in mTOR-dependent gene expression by ribosome 
profiling and to minimize compensatory feedback mechanisms (Sup- 
plementary Fig. 1c-f). 

Ribosome profiling revealed 144 target mRNAs selectively 
decreased at the translational level upon PP242 treatment (log, =—1.5 
(false discovery rate <0.05)) compared to rapamycin treatment, with 
limited changes in transcription (Fig. 1a and Supplementary Figs 2a, b 
and 3-10). The fact that at this time point rapamycin treatment did 
not markedly affect gene expression is consistent with incomplete 
allosteric inhibition of mTOR activity (Supplementary Fig. 1b). By 
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Figure 1 | Ribosome profiling reveals mTOR-dependent specialized 
translational control of the prostate cancer genome. a, Representative 
comparison of mRNA abundance and translational efficiency after a 3-h 
treatment with an ATP site inhibitor (PP242) versus an allosteric inhibitor 
(rapamycin). b-d, Free energy, length and percentage G+C content of the 5’ 
UTRs of mTOR target versus non-target mRNAs (error bars indicate range, 
non-target n = 5,022, target n = 144, two-sided Wilcoxon). e, Functional 
classification of translationally regulated mTOR-responsive mRNAs. 

f, Chemical structure of INK128. g, Representative western blot from three 
independent experiments of mTOR-sensitive invasion genes in PC3 cells after a 
48-h drug treatment. Rapa, rapamycin. 


monitoring footprints of translating 80S ribosomes, our findings show 
that the effects of PP242 are largely at the level of translation initiation 
and not elongation (Supplementary Fig. 3). It has been proposed that 
mRNAs translationally regulated by mTOR may contain long 5’ 
untranslated regions (5’ UTRs) with complex RNA secondary 
structures. On the contrary, ribosome profiling revealed that mTOR- 
responsive 5’ UTRs possess less complex features (Fig. 1b-d), provid- 
ing a unique data set to investigate the nature of regulatory elements 
that render these mRNAs mTOR-sensitive. It has been previously 
shown that some mTOR translationally regulated mRNAs, most 
notably those involved in protein synthesis, possess a 5’ terminal 
oligopyrimidine tract (5’ TOP)'*"* that is regulated by distinct trans- 
acting factors'*'’”. Of the 144 mTOR-sensitive target genes, 68% 
possess a 5’ TOP. However, as the 5’ TOP is not present in all 
mTOR-sensitive mRNAs, we next asked whether other 5’ UTR 
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consensus sequences may exist. Strikingly, 63% of mTOR target 
mRNAs possess what we have termed a pyrimidine-rich translational 
element (PRTE) within their 5’ UTRs (P = 3.2 X 10 1"). This element, 
unlike the 5’ TOP sequence, consists of an invariant uridine at position 
6 flanked by pyrimidines and, importantly, does not reside at position 
+1 ofthe 5’ UTR (Supplementary Figs 2c and 7). We found that 89% of 
the mTOR-responsive genes possess a PRTE and/or 5’ TOP, making 
the presence of one or both sequences a strong predictor for mTOR 
sensitivity (Supplementary Figs 2d and 7). Notably, mRNA isoforms 
arising from distinct transcription start sites may possess both a 5’ 
TOP and a PRTE. Moreover, given the significant number of 
mRNAs that contain both the PRTE and 5’ TOP, a functional interplay 
may exist between these regulatory elements. Future studies are 
required to determine the regulatory logic for how these sequences 
either independently or coordinately confer mTOR responsiveness. 
Multiple cis-acting elements within specific 5’ UTRs could reflect 
regulation by distinct mTOR effectors. For example, our findings show 
that the PRTE imparts translational control specificity to 4EBP 1 activity 
(see below). 

Surprisingly, mTOR-sensitive genes stratify into unique functional 
categories that may promote cancer development and progression, 
including cellular invasion (P = 0.009), cell proliferation (P = 0.04), 
metabolism (P= 0.0002) and regulators of protein modification 
(P=0.01) (Fig. le). The largest fraction of mTOR-responsive 
mRNAs cluster into a node consisting of key components of the 
translational apparatus: 70 ribosomal proteins, 6 elongation factors, 
and 4 translation initiation factors (P= 7.5 X 10 *”) (Fig. le and 
Supplementary Fig. 5). Therefore, this class of mTOR-responsive 
mRNAs may represent an important regulon that sustains the elevated 
protein synthetic capacity of cancer cells. 

Notably, the second largest node of mTOR translationally regulated 
genes comprises bona fide cell invasion and metastasis mRNAs 
and putative regulators of this process (Fig. le). This group includes 
YB1 (Y-box binding protein 1; also called YBX1), vimentin, MTA1 
(metastasis associated 1) and CD44 (Supplementary Fig. 11a). YB1 
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Figure 2 | mTOR promotes prostate cancer cell migration and invasion 
through a translationally regulated gene signature. a, Matrigel invasion assay 
in PC3 cells: 6-h pre-treatment followed by 6h of cell invasion (n = 6, 
ANOVA). b, c, Migration patterns and average distance travelled by GFP- 
labelled PC3 cells during hours 3-4 and 6-7 of drug treatment (n = 34 cells per 
condition, ANOVA). d, Matrigel invasion assay in PC3 cells after 48 h of 
knockdown of YB1, MTA1, CD44, or vimentin followed by 24h of cell invasion 
(n = 7, t-test). e, Matrigel invasion assay in BPH-1 cells after 48 h of 
overexpression of YBI and/or MTA1, followed by cell invasion for 24h (n = 7, 
t-test). Rapa, rapamycin. All data represent mean + s.e.m. NS, not statistically 
significant. 
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regulates the post-transcriptional expression of a network of invasion 
genes'®. Vimentin, an intermediate filament protein, is highly upregu- 
lated during the epithelial-to-mesenchymal transition associated with 
cellular invasion’. MTA1, a putative chromatin-remodelling protein, 
is overexpressed in invasive human prostate cancer* and has been 
shown to drive cancer metastasis by promoting neoangiogenesis”. 
CD44 is commonly overexpressed in tumour-initiating cells and is 
implicated in prostate cancer metastasis”*. Consistent with their status 
as mTOR sensitive genes, YB1, vimentin, MTA1 and CD44 all possess a 
PRTE (Supplementary Fig. 5). Vimentin and CD44 also possess a 5’ 
TOP (Supplementary Fig. 7). To test the functional role of the PRTE in 
mediating translational control, we mutated the PRTE within the 5’ 
UTR of YB1, which rendered the YB1 5' UTR insensitive to inhibition 
by 4EBP1 (Supplementary Fig. 11b). These findings highlight a novel 
cis-regulatory element that may modulate translational control of 
subsets of mRNAs upon mTOR activation. Moreover, ribosome 
profiling reveals unexpected transcript-specific translational control, 
mediated by oncogenic mTOR signalling, including a distinct set of 
pro-invasion and metastasis genes. 


Translation of pro-invasion mRNAs by mTOR 

We next extended the use of the mTOR pharmacological tools used in 
ribosome profiling towards functional characterization of the newly 
identified mTOR-sensitive cell invasion gene signature. To this end, 
we developed a new clinical-grade mTOR ATP site inhibitor, INK128, 
derived from the PP242 chemical scaffold (Fig. 1f). In brief, a structure- 
guided optimization of pyrazolopyrimidine derivatives was performed 
(see INK128 chemical synthesis in Supplementary Information) that 
improved oral bioavailability while retaining mTOR kinase potency 
and selectivity. INK128 was selected for clinical studies on the basis 
of its high potency (1.4nM inhibition constant (K;)), selectivity for 
mTOR, low molecular mass, and favourable pharmaceutical properties 
(Supplementary Figs 12 and 13). 
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Figure 3 | The 4EBP1-eIF4E axis controls the post-transcriptional 
expression of mTOR-sensitive invasion genes. a, Schematic of the 
pharmacogenetic strategy to inhibit p70S6K1/2 or eIF4E hyperactivation. 

b, Representative western blot from three independent experiments of PC3 
4EBP1™ cells after 48-h doxycycline induction of 4EBP1™. c, Representative 
western blot from three independent experiments of PC3 cells after 48-h DG-2 
treatment. d, Representative western blot from three independent experiments 
of PC3 cells after 48h of 4EBP1/4EBP2 knockdown followed by 24-h INK128 
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Using either PP242 or INK128, we observed a selective decrease in 
the expression of YB1, MTAI, vimentin and CD44 at the protein but 
not transcript level in PC3 cells starting at 6h of treatment, which 
precedes any decrease in de novo protein synthesis (Fig. 1g and 
Supplementary Figs 1c, d, 14 and 15). In contrast, rapamycin treatment 
did not alter their expression (Fig. 1g and Supplementary Fig. 14a). 
Similar findings were observed using a broad panel of metastatic cell 
lines of distinct histological origins (Supplementary Fig. 16). The four- 
gene invasion signature is positively regulated by mTOR hyperactiva- 
tion, as silencing PTEN expression increased their protein but not 
mRNA expression levels (Supplementary Fig. 17). We next investi- 
gated the effects of mTOR ATP site inhibitors on prostate cancer cell 
migration and invasion. We found that INK128, but not rapamycin, 
decreases the invasive potential of PC3 prostate cancer cells (Fig. 2a). 
Furthermore, INK128 inhibits cancer cell migration starting at 6h of 
treatment, precisely correlating with when decreases in the expression 
of pro-invasion genes are evident, but preceding any changes in the cell 
cycle or overall global protein synthesis (Fig. 2b, c, and Supplementary 
Figs 1c, e, f, 14b and 18). 

Among the genes comprising the pro-invasion signature, YB1 has 
been shown to act directly as a translation factor that controls expres- 
sion of a larger set of genes involved in breast cancer cell invasion”. 
Notably, YB1 translationally regulated target mRNAs, including 
SNAIL1 (also called SNA), LEF1 and TWIST1, decreased at the protein 
but not transcript level upon YB1 knockdown in PC3 cells (Sup- 
plementary Figs 19 and 20). To determine the functional role of 
YB1 in prostate cancer cell invasion, we silenced YB1 gene expression 
in PC3 cells, and observed a 50% reduction in cell invasion (Fig. 2d). 
Similarly, knockdown of MTA1, CD44, or vimentin also inhibited 
prostate cancer cell invasion (Fig. 2d and Supplementary Fig. 19). 
These mTOR target mRNAs may be sufficient to endow primary 
prostate cells with invasive features, as overexpression of YBI and/or 
MTAI (Supplementary Fig. 21a) in BPH-1 cells, an untransformed 
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treatment (see quantification of independent experiments in Supplementary 
Fig. 23a). e, Representative western blot from three independent experiments of 
wild type (WT) and 4EBP1/4EBP2 double knockout (DKO) MEFs treated with 
INK128 for 24h. f, Representative western blot from two independent 
experiments of wild-type and mSin1/~ (also called Mapkap1""""*") MEFs 
after 24-h INK128 treatment. g, Matrigel invasion assay upon 48-h doxycycline 
induction of 4EBP1™, or treatment with DG-2 compared to control (n = 6 per 
condition, t-test). All data represent mean + s.e.m. 
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prostate epithelial cell line, increased the invasive capacity of these cells 
in an additive manner (Fig. 2e). Notably, the effects of YB1 and MTA1 
on cell invasion are independent from any effect on cell proliferation in 
both knockdown or _ overexpression studies (Supplemen- 
tary Fig. 21b, c). Therefore, translational control of pro-invasion 
mRNAs by oncogenic mTOR signalling alters the ability of epithelial 
cells to migrate and invade, a key feature of cancer metastasis. 


Dissecting mTOR translational effectors 


We sought to determine the molecular mechanism by which pro- 
invasion genes are regulated at the translational level and why these 
mRNAs are sensitive to INK128 but not rapamycin. To this end, we 
investigated whether the translational regulators downstream 
mTORCI1, 4EBP1 and/or p70S6K1/2, control the expression of these 
mTOR-sensitive targets. We generated a human prostate cancer cell 
line that stably expresses a a doxycycline- inducible dominant-negative 
mutant of 4EBP1 (4EBP1™) (Fig. 3a)'*. This mutant binds to eIF4E, 

decreasing its hyperactivation without inhibiting general mTORC1 
function (Supplementary Fig. 22a). Notably, expression of 4EBP1™ 
does not alter global protein synthesis (Supplementary Fig. 22b), 
probably because endogenous 4EBP1 and 4EBP2 proteins retain their 
ability to to bind to elF4E (Supplementary Fig. 22c)'*. Upon induction of 
4EBP1™, YB1, vimentin, CD44 and MTAI decrease at the protein but 
not mRNA level, whereas pharmacological inhibition of p70S6K1/2 
with DG-2 (ref. 23) had no effect (Fig. 3b, cand Supplementary Fig. 22d). 
Next, we tested whether INK128 decreases expression of the four 
invasion genes through the 4EBP-eIF4E axis. Notably, knockdown 
of 4EBP1 and 4EBP2 in PC3 cells or using 4EBP1 and 4EBP2 double 
knockout mouse embryonic fibroblasts (MEFs)** reduced the ability of 
INK128 to decrease expression of these pro-invasion mRNAs (Fig. 3d, e 
and Supplementary Fig. 23). Furthermore, ablation of mTORC2 
activity~* had no effect on the expression of these mRNAs or respon- 
siveness to INK128 (Fig. 3f and Supplementary Fig. 24a—c). Next, we 
determined the effect of 4EBP1™ on human prostate cancer cell inva- 
sion. The expression of 4EBP1™ resulted in a significant decrease in 
prostate cancer cell invasion without affecting the cell cycle, whereas 
DG-2 had no effect (Fig. 3g and Supplementary Fig. 24d). These 
findings demonstrate that eIF4E hyperactivation downstream of 
oncogenic mTOR regulates translational control of the pro-invasion 
mRNAs and provides an explanation for the selective targeting of this 
gene signature by mTOR ATP site inhibitors. 


Examining cell invasion networks in vivo 


Both CK5* and CK8* prostate epithelial cells have been im iplicated i in 
the initiation of prostate cancer upon loss of PTEN’®””. Pten'*?/?; Ph- 

cre (Pten’’") mice are an ideal model of prostate cancer because they 
display distinct stages of cancer development (prostatic intraepithelial 
neoplasia, invasive adenocarcinoma, and metastasis)’**. However, the 
expression patterns of YB1, vimentin, CD44 and MTA1 in prostate 
basal (CK5~) and luminal (CK8~) epithelial cells have not been char- 
acterized. We therefore analysed their expression patterns in the 
Pten!” prostate cancer mouse model, where mTOR is constitutively 
hyperactivated””*. We found that YB1 localizes to the cytoplasm and 
nucleus of CK5* and CK8* prostate epithelial cells, consistent with its 
ability to shuttle between the two cellular compartments (Fig. 4a, b and 
Supplementary Fig. 25a, b)'*”?. MTA1 expression is exclusively nuclear 
in both cell types (Fig. 4c, d). Of note, CD44, together with other cell- 
surface markers, has been used to isolate a rare prostate stem-cell 
population*®. We observed expression of CD44 within a subset of 
CK5* and CK8* epithelial cells (Fig. 4e, f). In contrast, vimentin is 
not detected in either cell type (Fig. 4g). We next determined the impact 
of mTOR hyperactivation on the expression pattern of the pro-invasion 
gene signature. YB1, MTA1 and CD44 protein, but not transcript, levels 
were significantly increased in both Pten'” luminal and basal epithelial 
cells compared to wild type (Fig. 4h and Supplementary Fig. 25c-e). 

Interestingly, a subset of Pten'’” luminal epithelial cells ectopically 
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expresses vimentin at aberrantly high levels, with a perinuclear distri- 
bution (Fig. 4g and Supplementary Fig. 25f, g) suggesting that these 
cells may have acquired some mesenchymal-like features. Consistent 
with these findings, perinuclear vimentin localization is associated 
with invasive features of human prostate cancer cells*' and changes 
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Figure 4 | mTOR hyperactivation augments translation of YB1, MTA1, 
CD44 and vimentin mRNAs in a subset of pre-invasive prostate cancer cells 
in vivo. Left: immunofluorescent images of CK8/DAPI or CK5/DAPI with 
YB1 (a, b), MTAI (c, d), or CD44 (e, f) co-staining in 14-month-old wild-type 


and Pten’”” mouse prostate epithelial cells. White boxes outline the area 
magnified in the right panel. Right: magnified immunofluorescent images of 
YB1 (a, b), MTAI (¢, d) and CD44 (e, f) co-stained with DAPI in wild-type and 
Pten’”” mouse prostate epithelial cells. Dotted lines encircle the cytoplasm (C) 
and/or the nucleus (N). g, Representative immunofluorescent images of CK5 or 
CK8 co-staining with vimentin in 14-month-old wild-type and Pten’”” mouse 
prostate epithelial cells. S, stroma; yellow arrows indicate perinuclear vimentin. 
h, Box plot of YB1 (N = nuclear, C = cytoplasmic), MTA1 and CD44 mean 
fluorescence intensity (m.f.i.) per CK5* or CK8™ prostate epithelial cell in wild- 
type and Pten'" mice (three mice per arm, n = 43-303 cells quantified per 
target gene, error bars indicate range (see Supplementary Fig. 25b); 

*P < 0.0001, **P = 0.0004, t-test). 
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in cell polarity in actively moving fibroblasts**. These studies reveal a 
unique, translationally controlled signature of gene expression down- 
stream of mTOR hyperactivation in a cancer-initiating subset of pro- 
state epithelial cells. 


Targeting prostate cancer metastasis 

The most significant pre-clinical extension of this work would be to 
determine the therapeutic benefit of INK128 in reprogramming 
expression of the mTOR-dependent pro-invasion gene signature 
and prostate cancer metastasis directly in vivo. This is underscored 
by the clinical inefficacy of allosteric mTOR inhibition towards the 


ARTICLE 


lethal form of metastatic human prostate cancer****. Importantly, in 
our preclinical trial of RADOO1 (rapalog) versus INK128 in Pten’”” 
mice, 4EBP1 and p70S6K1/2 phosphorylation was completely 
restored to wild-type levels after treatment with INK128, whereas 
RADOO1 only decreased p70S6K1/2 phosphorylation _ levels 
(Supplementary Fig. 26a, b). We next determined the cellular con- 
sequences of complete versus partial mTOR inhibition during distinct 
stages of prostate cancer. INK128 treatment resulted in a 50% 
decrease in prostatic intraepithelial neoplasia (PIN) lesions in 
Pten’’” mice that was associated with decreased proliferation and a 
tenfold increase in apoptosis (Supplementary Fig. 26d-f). Notably, 
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Figure 5 | Complete mTOR inhibition by INK128 treatment prevents 
prostate cancer invasion and metastasis in vivo. a, Diagram and images of 
normal prostate gland, pre-invasive PIN and invasive prostate cancer. CK8/ 
CKS5, luminal/basal epithelial cells, respectively. Yellow arrowheads indicate 
invasive front. b, Immunofluorescent images of 14-month-old Pten’”” lymph 
node (LN) metastasis co-stained with CK8/androgen receptor (AR), CK8/YB1 
and CK8/MTA1. ¢, Left: human tissue microarray of YB1 protein levels in 
normal (n = 59), PIN (n =5), cancer (n = 99) and CRPC (n = 3) (ANOVA). 
Right: immunohistochemistry of YB1 in human CRPC demarcated by the red 
line (inset shows nuclear and cytoplasmic YB1). d, Quantification of invasive 


prostate glands in wild-type and Pten"” mice before (12-months old) and after 


(14-months old) 60 days of INK128 treatment (7 = 6 mice per arm, ANOVA). 
e, f, Area and number of CK8/AR* metastases in draining lymph nodes in 14- 
month-old Pten’" mice after 60 days of INK128 treatment (n = 6 mice per 
arm, t-test). g, Percentage decrease of YB1 (N = nuclear, C = cytoplasmic), 
MTAI, CD44, or vimentin protein levels (determined by quantitative 
immunofluorescence, Supplementary Fig. 25b) in CK8* or CK5™ prostate cells 
(CKs* only for vimentin) in INK128-treated 14-month-old Pten'” mice 
normalized to vehicle-treated mice (n = 3 mice per arm, t-test). All data 
represent mean + s.e.m. 
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the unique cytotoxic properties of INK128 treatment in Pten'”” mice 
were evidenced by a marked reduction in prostate cancer volume. In 
addition, and consistent with these findings, INK128 induced pro- 
grammed cell death in multiple cancer cell lines (Supplementary Fig. 
27a, b). In contrast, RADOO1 treatment mainly had cytostatic effects 
leading to only partial regression of PIN lesions associated with a 
limited decrease in cell proliferation and no significant effect on apop- 
tosis (Supplementary Fig. 26c-f). 

We extended the preclinical trial by examining the effects of 
INK128 treatment on the pro-invasion gene signature and prostate 
cancer metastasis, which is incurable and the primary cause of patient 
mortality. Cell invasion is the critical first step in metastasis, required 
for systemic dissemination. In Pten”” mice after the onset of PIN, a 
subset of prostate glands show characteristics of luminal epithelial cell 
invasion by 12 months (Fig. 5a and Supplementary Fig. 27c)**. After 
12 months of age, Pten’’” mice develop lymph-node metastases and 
these cells maintain strong YB1 and MTA] expression (Fig. 5b). We 
further extended these findings directly to human prostate cancer 
patient specimens, observing that YB1 expression levels increase in 
a stepwise fashion from normal prostate to castration-resistant pro- 
state cancer (CRPC), an advanced form of the disease associated with 
increased metastatic potential (Fig. 5c). MTA1 levels exhibit similar 
increases”. In human prostate cancer, high-grade primary tumours 
that display invasive features are more likely to develop systemic 
metastasis than low-grade non-invasive tumours***. Remarkably, 
treatment with INK128 completely blocked the progression of invas- 
ive prostate cancer locally in the prostate gland, and profoundly inhib- 
ited the total number and size of distant metastases (Fig. 5d-f). This 
was associated with a marked decrease in the expression of YB1, 
vimentin, CD44 and MTA1 at the protein, but not transcript, level 
in specific epithelial cell types within pre-invasive PIN lesions in 
Pten'”’ mice (Fig. 5g and Supplementary Fig. 25c). Together, these 
findings reveal an unexpected role for oncogenic mTOR signalling in 
control of a pro-invasion translational program that, along with the 
lethal metastatic form of prostate cancer, can be efficiently targeted 
with clinically relevant mTOR ATP site inhibitors. 


Discussion 


Here we used ribosome profiling to generate a comprehensive map of 
translationally controlled mTOR targets in cancer that surprisingly 
stratify into specific cellular processes including proliferation, meta- 
bolism, protein synthesis and invasion (Fig. le). The effects of this 
translational control program are probably broad, converging on 
many subclasses of mRNAs that may cooperate towards distinct steps 
in cancer development and therapeutic response. This is supported by 
our in vivo findings where we developed a novel clinically relevant 
mTOR inhibitor, INK128, that significantly abrogates multiple 
aspects of prostate cancer development by inducing apoptosis as well 
as inhibiting cell proliferation, invasion and metastasis (Fig. 5d-g and 
Supplementary Fig. 26c-f). The superiority of INK128 as an mTOR 
inhibitor is also evident in its ability to reprogram the mTOR onco- 
genic translational program in prostate cancer, which is not achieved 
by rapalog treatment. Of note, however, the sensitivity of cells from 
distinct histological origins to ATP site versus allosteric inhibitors of 
mTOR may differ. For example, the Jurkat lymphoid cell line is par- 
ticularly sensitive to rapamycin treatment”. 

One of the most novel nodes of mTOR translationally controlled 
genes are those that cooperatively control, at least in part, the cellular 
invasive features of human prostate cancer cells (Figs 1g, 2 and 3b, g). 
Translational control of these mRNAs relies on the 4EBP 1-eIF4E axis 
and is thereby specifically druggable with potent mTOR ATP site 
inhibitors, which, unlike rapamycin, target mTOR-dependent 
4EBP1 phosphorylation (Figs 1g, 3d, e and 5g, and Supplementary 
Figs 1b, 23 and 26b). This has significant therapeutic implications not 
only for advanced prostate cancer but also for multiple metastatic 
cancers where we show that translational control of pro-invasion 
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mRNAs is sensitive to ATP site inhibitors of mTOR (Supplemen- 
tary Fig. 16). Thereby, these studies link translational regulation to 
the poorly understood mechanisms underlying cancer metastasis. 
Intriguingly, various components of the translation machinery, includ- 
ing oncogenic eIF4E”, localize to the leading edge of migrating fibro- 
blasts. This may allow spatially controlled translation of mRNAs 
important for cell migration, providing a rapid and specific response 
in transducing a migration program that could be co-opted at the 
invasive edge of metastatic cancer cells. Together, these studies reveal 
that the ability of mTOR to phosphorylate general translation factors 
results in exquisite transcript-specific translational control of key 
mRNAs that may cooperate in distinct steps of cancer initiation and 
progression, with significant implications for therapeutic intervention. 


METHODS SUMMARY 


Mice. Pten'°*?/"°*? and Pb-cre mice were obtained from Jackson Laboratories and 
Mouse Models of Human Cancers Consortium (MMHCC) and maintained in the 
C57BL/6 background. 

Ribosome profiling. PC3 lysates were subjected to ribosome footprinting by 
nuclease treatment. Ribosome-protected and alkaline digested poly(A) mRNA 
fragments were purified and deep sequencing libraries were generated. Ribosome 
footprint and RNA-seq sequencing reads were aligned against a library of tran- 
scripts from the UCSC Known Genes database GRCh37/hg19. Read density 
profiles were constructed for the canonical transcript of each gene. The average 
read density per codon was computed for the coding sequence of each transcript. 
Average read density was used to determine mRNA abundance (RNA-seq reads), 
ribosome occupancy of mRNAs (foot print reads), and translational efficiency 
(foot print reads/RNA-seq reads). 

Immunofluorescence. Paraffin-embedded mouse prostates and lymph nodes 
were deparaffinized and rehydrated using CitriSolv (Fisher) and serial ethanol 
washes. Antigen unmasking was performed using Citrate pH 6 (Vector Labs). 
Sections were blocked in 5% goat serum, 1% BSA in TBS. Various primary 
antibodies were used at dilutions between 1:50 and 1:500 (see Methods), followed 
by incubation with appropriate conjugated secondary antibodies. Samples were 
mounted with DAPI Hardset Mounting Medium (Vector Lab). A Zeiss Spinning 
Disc confocal (Zeiss, CSU-X1) was used to image the tissues. Individual cells were 
quantified for mean fluorescence intensity using the Axiovision (Zeiss, Release 
4.8) densitometric tool. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Mice. Pten!?*?/"°*? and Pb-cre mice where obtained from Jackson Laboratories and 
Mouse Models of Human Cancers Consortium (MMHCC), respectively, and 
maintained in the C57BL/6 background. Mice were maintained under specific 
pathogen-free conditions, and experiments were performed in compliance with 
institutional guidelines as approved by the Institutional Animal Care and Use 
Committee of UCSF. 

Cell culture and reagents. Human cell lines were obtained from the ATCC and 
maintained in the appropriate medium with supplements as suggested by ATCC. 
Wild-type, mSin1~‘~ (provided by B. Su), and 4EBP1/4EBP2 double knockout 
MEFs (provided by N. Sonenberg) were cultured as previously described**”’. 
SMARTvector 2.0 (Thermo Scientific) lentiviral shRNA constructs were used 
to knock down PTEN (SH-003023-02-10). For generation of GFP-labelled PC3 
cells, SMARTvector 2.0 lentiviral empty vector control particles that contain 
TurboGFP (S-004000-01) were used. Control (D-001810-01), YBI (L-010213), 
MTAI (L-004127), CD44 (L-009999), vimentin (L-003551), rictor (LL-016984), 
4EBP1 (L-003005) and 4EBP2 (L-018671) pooled siRNAs were purchased from 
Thermo Scientific. Intellikine provided INK128 and PP242, which were used at 
200 nM and 2.5 uM in cell-based assays unless otherwise specified. RAD0O01 was 
obtained from LC Laboratories. DG-2 was provided by K. Shokat and used at 
20 uM in cell-based assays. Rapamycin was purchased from Calbiochem and used 
at 50nM in cell-based assays. Doxycyline (Sigma) was used at 1 gml' in 
4EBP1™ induction assays. Lipofectamine 2000 (Invitrogen) was used to transfect 
cancer cell lines with siRNA. Amaxa Cell Line Nucleofector Kit R (Lonza) was 
used to electroporate BPH-1 cells with over expression vectors. The 4EBP1™ has 
been previously described’’. 

Plasmids. pcDNA3-HA-YB1 was provided by V. Evdokimova. pCMV6-Myk- 
DDK-MTAI was purchased from Origene. pGL3-Promoter was purchased from 
Promega. To clone the 5’ UTR of YB1 into pGL3-Promoter, the entire 5’ UTR 
sequence of YB1 was amplified from PC3 cDNA. PCR fragments were digested 
with HindIII and NcoI and ligated into the corresponding sites of pGL3- 
Promoter. The PRTE sequence at position +20-34 in the YB1 5’ UTR (UCSC 
kgID uc001chs.2) was mutated using the QuikChange Site-Directed Mutagenesis 
Kit following the manufacturer’s protocol (Stratagene). 

Ribosome profiling. PC3 cells were treated with rapamycin (50nM; 
Calbiochem) or PP242 (2.5 1M; Intellikine) for 3h. Cells were subsequently 
treated with cycloheximide (100 1g ml '; Sigma) and detergent lysis was per- 
formed in the dish. The lysate was treated with DNase and clarified, and a sample 
was taken for RNA-seq analysis. Lysates were subjected to ribosome foot printing 
by nuclease treatment. Ribosome-protected fragments were purified, and deep 
sequencing libraries were generated from these fragments, as well as from poly(A) 
mRNA purified from non-nuclease-treated lysates. These libraries were analysed 
by sequencing on an Illumina GAII. 

Each sequencing run resulted in approximately 20-25 million raw reads per 
sample, of which 5-12 million unique reads were used for subsequent analysis. 
Ribosome footprint and RNA-seq sequencing reads were aligned against a library 
of transcripts from the UCSC Known Genes database GRCh37/hg19. The first 25 
nucleotides of each read were aligned using Bowtie and this initial alignment was 
then extended to encompass the full fragment-derived portion of the sequencing 
read while excluding the linker sequence. Read density profiles were then con- 
structed for the canonical transcript of each gene, using only reads with 0 or 1 total 
mismatches between the read sequence and the reference sequence, comprised of 
the transcript fragment followed by the linker sequence. Footprint reads were 
assigned to an A site nucleotide at position + 15 to +17 of the alignment, based on 
the total fragment length; mRNA reads were assigned to the first nucleotide of the 
alignment. The average read density per codon was then computed for the coding 
sequence of each transcript, excluding the first 15 and last 5 codons, which can 
display atypical ribosome accumulation. 

Average read density was used as a measure of mRNA abundance (RNA-seq 
reads) and of protein synthesis (ribosome profiling reads). For most analyses, 
genes were filtered to require at least 256 reads in the relevant RNA-seq samples. 
Translational efficiency was computed as the ratio of ribosome footprint read 
density to RNA-seq read density, scaled to normalize the translational efficiency 
of the median gene to 1.0 after excluding regulated genes (log fold-change +1.5 
after normalizing for the all-gene median). Changes in protein synthesis, mRNA 
abundance and translational efficiency were similarly computed as the ratio of 
read densities between different samples, normalized to give the median gene a 
ratio of 1.0. This normalization corrects for differences in the absolute number of 
sequencing reads obtained for different libraries. 3,977 (replicate 1), and 5,333 
(replicate 2) unique mRNAs passed a preset read threshold of 256 reads for single- 
gene quantification for all treatment conditions. 

Western blot analysis. Western blot analysis was performed as previously 
described"? with antibodies specific to phospho-AKT™”* (Cell Signaling), AKT 


(Cell Signaling), phospho-p70S6K™**? (Cell Signaling), phospho-rpS6*40?4 
(Cell Signaling), rpS6 (Cell Signaling), phospho-4EBP1'*”“° (Cell Signaling), 
4EBP1 (Cell Signaling), 4EBP2 (Cell Signaling), YB1 (Cell Signaling), CD44 
(Cell Signaling), LEF1 (Cell Signaling), PTEN (Cell Signaling), eEF2 (Cell 
Signaling), GAPDH (Cell Signaling), vimentin (BD Biosciences), eIF4E (BD 
Biosciences), Flag (Sigma), B-actin (Sigma), MTA1 (Santa Cruz Biotechnology), 
Twist (Santa Cruz Biotechnology), rpL28 (Santa Cruz Biotechnology), HA 
(Covance) and rictor (Bethyl Laboratory). 

qPCR analysis. RNA was isolated using the manufacturer’s protocol for RNA 
extraction with TRIzol Reagent (Invitrogen) using the Pure Link RNA mini kit 
(Invitrogen). RNA was Dnase-treated with Pure Link Dnase (Invitrogen). Dnase- 
treated RNA was transcribed to cDNA with SuperScript III First-Strand Synthesis 
System for RT-PCR (Invitrogen), and 1 1l of cDNA was used to run a SYBR green 
detection qPCR assay (SYBR Green Supermix and MyiQ2, Biorad). Primers were 
used at 200 nM. 

5’ UTR analysis. 5’ UTRs of the 144 downregulated mTOR target genes were 
obtained using the known gene ID from the UCSC Genome Browser (GRCh37/ 
hg19). Target versus non-target mRNAs were compared for 5’ UTR length, 
%G+C content and Gibbs free energy by the Wilcoxon two-sided test. 
Multiple £,, (expectation maximization) for Motif Elicitation (MEME) and 
Find Individual Motif Occurrences (FIMO) was used to derive the PRTE and 
determine its enrichment in the 144 mTOR-sensitive genes compared a back- 
ground list of 3,000 genes. The Database of Transcriptional Start Sites (DBTSS 
Release 8.0) was used to identify putative 5’ TOP genes and putative transcription 
start sites in the 144 mTOR target genes. 

Luciferase assay. PC3 4EBP1™ cells were treated with 1 pg ml! doxycycline 
(Sigma) for 24h. Cells were transfected with various pGL3-Promoter constructs 
using lipofectamine 2000 (Invitrogen). After 24h, cells were collected. 20% of the 
cells were aliquoted for RNA isolation. The remaining cells were used for the 
luciferase assay per the manufacturer’s protocol (Promega). Samples were mea- 
sured for luciferase activity on a Glomax 96-well plate luminometer (Promega). 
Firefly luciferase activity was normalized to luciferase mRNA expression levels. 
Kinase assays. mTOR activity was assayed using LanthaScreen Kinase kit 
reagents (Invitrogen) according to the manufacturer’s protocol. PI(3)K «, B, Y 
and 6 activity were assayed using the PI(3)K HTRE assay kit (Millipore) accord- 
ing to the manufacturer’s protocol. The concentration of INK128 necessary to 
achieve inhibition of enzyme activity by 50% (ICs) was calculated using con- 
centrations ranging from 201M to 0.1nM (12-point curve). ICs9 values were 
determined using a nonlinear regression model (GraphPad Prism 5). 

Cell proliferation assay. PC3 cells were treated with the appropriate drug for 48 h, 
and proliferation was measured using CellTiter-Glo Luminescent reagent (Promega) 
per the manufacturer’s protocol. The concentration of INK128 necessary to achieve 
inhibition of cell growth by 50% (IC;9) was calculated using concentrations ranging 
from 20.0 1M to 0.1 nM (12-point curve). 

Mouse xenograft study. Nude mice were inoculated subcutaneously in the right 
subscapular region with 5 X 10° MDA-MB-361 cells. After tumours reached a 
size of 150-200 mm’, mice were randomly assigned into vehicle control or treat- 
ment groups. INK128 was formulated in 5% polyvinylpropyline, 15% NMP, 80% 
water and administered by oral gavage at 0.3 mg kg ' and 1 mgkg™! daily. 
Pharmacokinetic analysis. The area under the plasma drug concentration versus 
time curves, AUCg_ y,,) and AUC(o_ inp, were calculated from concentration data 
using the linear trapezoidal rule. The terminal t/2 in plasma was calculated from 
the elimination rate constant (/z), estimated as the slope of the log-linear terminal 
portion of the plasma concentration versus time curve, by linear regression ana- 
lysis. The bioavailability (F) was calculated using F = (AUC(o-tast),poDiv.)/ 
(AUC(o-Iast),ivDp.o.) X 100%, where Dj. and D,.o, are intravenous and oral doses, 
respectively. Cmax was a highest drug concentration in plasma after oral admin- 
istration. Tmax was the time at which C,,,, is observed after extravascular admin- 
istration of drug. Tjast was the last time point a quantifiable drug concentration can 
be measured. 

Metabolic stability assay. In vitro metabolic stability of INK128 was evaluated 
after incubation with liver microsomes or liver S9 fractions from various species 
in the presence of NADPH. The half-life of INK128 was estimated by log linear 
regression analysis. 

CYP assay. INK128 inhibition of CYP450 isoforms in human liver microsomes 
was determined with isoform-specific substrates at concentrations approximately 
equal to the concentration at which the rate of the reaction is half-maximal (K,,) 
for the individual isoforms: CYP1A2, CYP2C8, CYP2C9, CYP2C19, CYP2D6 
and CYP3A4. 

Pharmaceutical property assays. The percentage of protein binding of INK128 
was determined in mouse, rat, dog, monkey and human plasma at CEREP. The ICs 
for the inhibitory effect of INK128 on hERG potassium channel was determined at 
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CEREP. A Bacterial Reverse Mutation Assay (Ames test) was conducted at 
BioReliance. 

Polysome analysis. PC3 cells were treated for 3h with either DMSO or INK128 
(100 nM). Cells were re-suspended in PBS containing 100 1g ml ' cycloheximide 
(Sigma) and incubated on ice for 10 min. Cells were centrifuged at 300g for 5 min at 
4 °C and lysed in 10 mM Tris-HCl pH 8, 140 mM NaCl, 5 mM MgCh, 640 U ml! 
Rnasin, 0.05% NP-40, 250 ug ml! cycloheximide, 20mM DTT and protease 
inhibitors. Samples were incubated for 20 min on ice then centrifuged once for 
5 min at 3,300g and once for 5 min at 9,300g, isolating the supernatant after each 
centrifugation. Lysates were loaded onto 10-50% sucrose gradients containing 
0.1mgml | heparin and 2mM DTT and centrifuged at 37,000 r.p.m. for 2.5h 
at 4°C. The sample was subsequently fractionated on a gradient fractionation 
system (ISCO). RNA was extracted from all fractions and run on a TBE-agarose 
gel to visualize 18S and 28S rRNA. Fractions 7-13 were found to correspond to the 
polysome fractions and were used for further qPCR analysis. 

[?°S] metabolic labelling. PC3 or PC3 4EBP1™ cells with or without indicated 
treatment were incubated with 30,Ci of [*°S]-methionine for 1h after pre- 
incubation in methionine-free DMEM (Invitrogen). Cells were prepared using 
a standard protein lysate protocol, resolved on a 10% SDS polyacrylamide gel and 
transferred onto a PVDF membrane (Biorad). The membrane was exposed to 
autoradiography film (Denville) for 24h and developed. 

Cell cycle analysis. Appropriately treated PC3, BPH-1, or PC3-4EBP1™ cells 
were fixed in 70% ethanol overnight at —20 °C. Cells were subsequently washed 
with PBS and treated with RNase (Roche) for 30 min. After this incubation, the 
cells were permeabilized and treated with 50 1g ml’ propidium iodide (Sigma) 
in a solution of 0.1% Tween, 0.1% sodium citrate. Cell cycle data was acquired 
using a BD FACS Caliber (BD Biosciences) and analysed with FlowJo (v.9.1). 
Apoptosis analysis. Appropriately treated LNCaP and A498 cells were labelled 
with Annexin V-FITC (BD Biosciences) and propidium iodide (Sigma) following 
the manufacturer’s instructions. PI/Annexin data was acquired using a BD FACS 
Caliber (BD Biosciences) and analysed with FlowJo (v.9.1). 

Matrigel invasion assay. BioCoat Matrigel Invasion Chambers (modified 
Boyden Chamber Assay; BD Biosciences) were used according to the manufac- 
turer’s instructions. 

Real-time imaging of cell migration. Real-time imaging of GFP-labelled PC3 
cells was performed in poly-p-lysine-coated chamber cover glass slides (Lab-Tek). 
PC3 GFP cells were plated and allowed to adhere for 24h. Wells were wounded 
with a P200 pipette tip. The chamber slides were imaged with an IX81 Olympus 
wide-field fluorescence microscope equipped with a CO, and temperature con- 
trolled chamber and time-lapse tracking system. Images from DIC and GFP 
channels were taken every 2 min and processed using Image] (http://rsb.info.nih. 
gov/ij/) and analysed for cell migration with Manual Tracking (http://rsbweb.nih. 
gov/ij/plugins/track/track.html), using local maximum centring correction to 
maintain a centroid xy coordinate for each cell per frame over time. Tracking data 
was subsequently processed with the Chemotaxis and Migration tool from ibidi 
(http://www. ibidi.de/applications/ap_chemo.html) to create xy coordinate plots, 
velocity and distance measurements. 

Snaill immunocytochemistry. Appropriately transfected or treated PC3 cells 
were plated on a poly-L-lysine-coated chamber slide (Lab-Tek) and cultured for 
48h. Cells were fixed with 4% paraformaldehye (EMS), rinsed with PBS and 
permeabilized with 0.1% Triton X-100. The samples were blocked in 5% goat 
serum and then incubated with anti-Snaill antibody (Cell Signaling) in 5% goat 
serum for 2h at room temperature. Cells were washed with PBS and incubated 
with Alexa 594 anti-mouse antibody (Invitrogen) and DAPI (Invitrogen) for 2h 
at room temperature. Specimens were again washed with PBS and subsequently 
mounted with Aqua Poly/Mount (Polysciences). Image capture and quantifica- 
tion were completed as described below (see Immunofluorescence). 
Cap-binding assay. PC3 4EBP1™ cells were induced with doxycycline (1 jig ml‘, 
Sigma) for 48h, then collected and lysed in buffer A (10 mM Tris-HCl pH 7.6, 
150mM KCl, 4mM MgCl, 1mM DTT, 1mM EDTA, and protease inhibitors, 
supplemented with 1% NP-40). Cell lysates were incubated overnight at 4 °C with 
50 pl of the mRNA cap analogue m’GTP-sepharose (GE Healthcare) in buffer A. 
The beads were washed with buffer A supplemented with 0.5% NP-40. Protein 
complexes were dissociated using 1X sample buffer, and resolved by SDS-PAGE 
and western blotted with the appropriate antibodies. 

Pharmacological treatment of Pten’” mice and MRI imaging. Nine- and 
twelve-month-old Pten’" mice were gavaged daily with either vehicle (see mouse 
xenograft study), RADOO1 (10 mg kg” '; LC Laboratories), or INK128 (1 mgkg '; 
Intellikine) for the indicated times. Weight measurements were taken every 3 
days to monitor for toxicity. For the 28-day study, mice were imaged via MRI at 
day 0 and day 28 in a 14-T GE MR scanner (GE Healthcare). 

Prostate tissue processing. Whole mouse prostates were removed from wild- 
type and Pten'” mice, microdissected, and frozen in liquid nitrogen. Frozen 
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tissues were subsequently manually disassociated using a biopulverizer 
(Biospec) and additionally processed for protein and mRNA analysis as described 
above. 

Immunofluorescence. Prostates and lymph nodes were dissected from mice within 
2h of the indicated treatment and fixed in 10% formalin overnight at 4 °C. Tissues 
were subsequently dehydrated in ethanol (Sigma) at room temperature, mounted 
into paraffin blocks, and sectioned at 5 um. Specimens were de-paraffinized and 
rehydrated using CitriSolv (Fisher) followed by serial ethanol washes. Antigen 
unmasking was performed on each section using Citrate pH 6 (Vector Labs) in a 
pressure cooker at 125°C for 10-30 min. Sections were washed in distilled water 
followed by TBS washes. The sections were then incubated in 5% goat serum, 1% 
BSA in TBS for 1h at room temperature. Various primary antibodies were used 
including those specific for keratin 5 (Covance), cytokeratin 8 (Abcam and 
Covance), YB1 (Abcam), vimentin (Abcam), MTA1 (Cell signaling), CD44 (BD 
Pharmingen) and the androgen receptor (Epitomics), which were diluted 1:50- 
1:500 in blocking solution and incubated on sections overnight at 4°C. 
Specimens were then washed in TBS and incubated with the appropriate Alexa 
488 and 594 labelled secondary (Invitrogen) at 1:500 for 2h at room temperature 
with the exception of YB1 which was incubated with biotinylated anti-rabbit 
secondary (Vector) followed by incubation with Alexa 594 labelled Streptavidin 
(Invitrogen). A final set of washes in TBS was completed at room temperature 
followed by mounting with DAPI Hardset Mounting Medium (Vector Lab). A 
Zeiss Spinning Disc confocal (Zeiss, CSU-X1) was used to image the sections at 
40X-100X. Individual prostate cells were quantified for mean fluorescence 
intensity (m.f.i.) using the Axiovision (Zeiss, Release 4.8) densitometric tool. 
Lymph node metastasis measurements. Mouse lymph nodes were processed as 
described above and stained for CK8 and androgen receptor. Lymph nodes were 
imaged using a Zeiss AX10 microscope. Metastases were identified and areas were 
measured using the Axiovision (Zeiss, Release 4.8) measurement tool. 
Semi-quantitative RT-PCR. Whole prostates were removed from wild-type and 
Pten!/ mice, microdissected, dissociated into single-cell suspension, and stained 
for epithelial cell markers as previously described*’ using fluorescence- 
conjugated antibodies for CD49f, Sca-1, CD31, CD45 and Terll9 (BD 
Biosciences). Luminal epithelial cells were sorted as previously described” using 
a FACS Aria (BD Biosciences). Cell pellets were re-suspended in 500 yl TRIzol 
Reagent and RNA was isolated and transcribed into cDNA as described above. 
Semi-quantitative PCR analysis was performed using oligonucleotides for vimentin 
and B-actin at 200 nM in a 25 ul reaction with 12.5 pl GoTaq (Promega) for 32 and 
33 cycles respectively, which were within the linear range (Supplementary Fig. 25f). 
Immunohistochemistry. Immunohistochemistry was performed as described 
above (see immunofluorescence section) with the exception that immediately after 
antigen presentation and TBS washes, specimens were incubated in 3% hydrogen 
peroxide in TBS followed by TBS washes. The following primary antibodies were 
used: phospho-AKT™”? (Cell Signaling), phospho-rpS6"”“* (Cell Signaling), 
phospho-4EBP1"°”/*° (Cell Signaling), phospho-histone H3 (Upstate), and 
cleaved caspase 3 (Cell Signaling). This was followed by TBS washes and incuba- 
tion with the appropriate biotinylated secondary antibody (Vector Lab) for 30 min 
at room temperature. An ABC-HRP Kit (Vector Lab) was used to amplify the 
signal, followed by a brief incubation in hydrogen peroxide. The protein of interest 
was detected using DAB (Sigma). Specimens were counterstained with haematoxylin 
(Thermo Scientific), dehydrated with Citrisolv (Fisher), and mounted with 
Cytoseal XYL (Vector Lab). 

Haematoxylin and eosin staining. Paraffin-embedded prostate specimens were 
deparaffinized and rehydrated as described above (see immunofluorescence section), 
stained with haematoxylin (Thermo Scientific), and washed with water. This was 
followed by a brief incubation in differentiation RTU (VWR) and two washes with 
water followed by two 70% ethanol washes. The samples were then stained with eosin 
(Thermo Scientific) and dehydrated with ethanol followed by CitriSolv (Fisher). 
Slides were mounted with Cytoseal XYL (Richard Allan Scientific). 
Oligonucleotides. YB1 5’ UTR cloning and site-directed mutagenesis oligo- 
nucleotides are as follows. YB1 5’ UTR cloning: forward 5'-GCTACAAGCTTGG 
GCTTATCCCGCCT-3’, reverse 5’-TCGATCCATGGGGTTGCGGTGATGGT-3’; 
deletion (20-34): forward 5'-TGGGCTTATCCCGCCTGTCCTTCGATCGGTA 
GCGGGAGCG-3’, reverse 5’-CGCTCCCGCTACCGATCGAAGGACAGGCG 
GGATAAGCCCA-3’; transversion (20-34): forward 5’-TGGGCTTATCCCGC 
CTGTCCGCGGTAAGAGCGATCTTCGATCGGTAGCGGGAGCG-3’, reverse 
5'-CGCTCCCGCTACCGATCGAAGATCGCTCTTACCGCGGACAGGCGGG 
ATAAGCCCA-3’. 

Human qPCR oligonucleotides are as follows. B-actin forward 5'-GCAA 
AGACCTGTACGCCAAC-3’, reverse 5’-AGTACTTGCGCTCAGGAGGA-3’; 
CD44 forward 5’-CAACAACACAAATGGCTGGT-3’, reverse 5'-CTGAGGT 
GTCTGTCTCTTTCATCT-3’; vimentin forward 5’-GGCCCAGCTGTAAGT 
TGGTA-3’, reverse 5’-GGAGCGAGAGTGGCAGAG-3’;  Snaill forward 
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5'-CACTATGCCGCGCTCTTTC-3’, reverse 5’-GCTGGAAGGTAAACTCTG 
GATTAGA-3’; YBI forward 5’-TCGCCAAAGACAGCCTAGAGA-3’, reverse 
5'-TCTGCGTCGGTAATTGAAGTTG-3'; MTAI forward 5'’-CAAAGTGGTG 
TGCTTCTACCG-3’, reverse 5'’-CGGCCTTATAGCAGACTGACA-3'; PLAU 
forward 5'-TTGCTCACCACAACGACATT-3’, reverse 5’-GGCAGGCAGATG 
GTCTGTAT-3’; FGFBP1 forward 5'-ACTGGATCCGTGTGCTCAG-3’, reverse 
5'-GAGCAGGGTGAGGCTACAGA-3’; ARID5B forward 5'-TGGACTCAACT 
TCAAAGACGTTC-3’, reverse 5'-ACGTTCGTTTCTTCCTCGTC-3’; CTGF 
forward 5’-CTCCTGCAGGCTAGAGAAGC-3’, reverse 5’-GATGCACTTTT 
TGCCCTTCTT-3'; RND3 forward 5'-AAAAACTGCGCTGCTCCAT-3’, 
reverse 5’-TCAAAACTGGCCGTGTAATTC-3’; KLF6 forward 5’-AAAGCTC 
CCACTTGAAAGCA-3’, reverse 5’-CCTTCCCATGAGCATCTGTAA-3’; 
BCL6 forward 5’-TTCCGCTACAAGGGCAAC-3’, reverse 5’-TGCAACGATA 
GGGTTTCTCA-3'; FOXAI forward 5'-AGGGCTGGATGGTTGTATTG-3’, 
reverse 5'-ACCGGGACGGAGGAGTAG-3’; GDF15 forward 5'-CCGGATAC 
TCACGCCAGA-3’, reverse 5'-AGAGATACGCAGGTGCAGGT-3'; HBP1 
forward 5'-GCTGGTGGTGTTGTCGTG-3’, reverse 5'-CATGTTATGGTGCT 
CTGACTGC-3’; Twist1 forward 5’-CATCCTCACACCTCTGCATT-3’, reverse 
5'-TTCCTTTCAGTGGCTGATTG-3’; LEF1 forward 5'-CCTTGGTGAACGA 
GTCTGAAATC-3’, reverse 5'-GAGGTTTGTGCTTGTCTGGC-3’;  rpS19 
forward 5'-GCTGGCCAAACATAAAGAGC-3’, reverse 5'-CTGGGTCTGAC 
ACCGTTTCT-3’; 5S rRNA forward 5’-GCCCGATCTCGTCTGATCT-3’, 
reverse 5’-AGCCTACAGCACCCGGTATT-3'; firefly luciferase forward 
5'-AATCAAAGAGGCGAACTGTG-3’, reverse 5’-TTCGTCTTCGTCCCAGT 
AAG-3’, 

Mouse gPCR oligonucleotides are as follows. B-actin forward 5’-CTAAGG 
CCAACCGTGAAAAG-3’, reverse 5’-ACCAGAGGCATACAGGGACA-3’; 
Yb1 forward 5'-GGGTTACAGACCACGATTCC-3’, reverse 5'-GGCGATACC 
GACGTTGAG-3’; vimentin forward 5’-TCCAGCAGCTTCCTGTAGGT-3’, 


reverse 5’-CCCTCACCTGTGAAGTGGAT-3’; Cd44 forward 5'-ACAGTACCT 
TACCCACCATG-3’, reverse 5’-GGATGAATCCTCGGAATTAC-3’; Mtal 
forward 5'-AGTGCGCCTAATCCGTGGTG-3’, reverse 5'-CTGAGGATGAG 
AGCAGCTTTCG-3’. 

siRNA/shRNA sequences are as follows. Control (D-001810-01) 5'-UGGU 
UUACAUGUCGACUAA-3’; vimentin (L-003551) 5’-UCACGAUGACCUUG 
AAUAA-3', 5'-GGAAAUGGCUCGUCACCUU-3’, 5'’-GAGGGAAACUAAU 
CUGGAU-3’, 5'-UUAAGACGGUUGAAACUAG-3’; YBI (L-010213) 5’-CUG 
AGUAAAUGCCGGCUUA-3’, 5'-CGACGCAGACGCCCAGAAA-3’, 5'-GUA 
AGGAACGGAUAUGGUU-3’, 5'-GCGGAGGCAGCAAAUGUUA-3'; MTA1 
(L-004127) 5’-UCACGGACAUUCAGCAAGA-3’, 5'’-GGACCAAACCGCAG 
UAACA-3’, 5'-GCAUCUUGUUGGACAUAUU-3’, 5’-CCAGCAUCAUUGA 
GUACUA-3’; CD44 (L-009999) 5’-GAAUAUAACCUGCCGCUUU-3’, 5'-CA 
AGUGGACUCAACGGAGA-3’, 5'-CGAAGAAGGUGUGGGCAGA-3’, _5/- 
GAUCAACAGUGGCAAUGGA-3’; 4EBP1 (L-003005) 5’-CUGAUGGAGU 
GUCGGAACU-3’, 5’-CAUCUAUGACCGGAAAUUC-3’, 5’-GCAAUAGCCC 
AGAAGAUAA-3', 5'‘-GAGAUGGACAUUUAAAGCA-3’; 4EBP2 (L-018671) 
5’-GCAGCUACCUCAUGACUAU-3’, 5’-GGAGGAACUCGAAUCAUUU- 
3’, 5'-GCAAUUCUCCCAUGGCUCA-3’, 5’-UUGAACAACUUGAACAA 
UC-3’; rictor (LL-016984) 5’-GACACAAGCACUUCGAUUA-3’, 5'-GAAGAU 
UUAUUGAGUCCUA-3’, 5'-GCGAGCUGAUGUAGAAUUA-3’, 5’-GGGA 
AUACAACUCCAAAUA-3’; PTEN SH-003023-01-10 5’-GCTAAGAGAGGT 
TTCCGAA-3’, SH-003023-02-10 5'‘-AGACTGATGTGTATACGTA-3’. 
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Sorting out the sirtuins 


Debates over the role of sirtuin proteins in ageing are maturing into functional assessments of the individual proteins. 
It seems that overexpression of a specific sirtuin can extend lifespan in male mice. 


DAVID B. LOMBARD 
& RICHARD A. MILLER 


braham Lincoln once said 
A God must have loved 

the common people because 
he made so many of them. Nature 
must feel the same way about the 
sirtuins, a large family of proteins 
that achieved celebrity status when 
one member was found to increase 
lifespan in yeast’. But are the mam- 
malian sirtuins the rock stars of an 
ensemble of anti-ageing proteins, or 
merely members of the entourage? 
The original model, proposed in 
about 2005, that sirtuins have broadly 
evolutionarily conserved roles in 
promoting longevity per se is now 
being refined through more detailed 
functional investigations of each 
sirtuin’. Ina paper published on Nature's 
website today, Kanfi et al.’ follow this trend 
by reporting that overexpression ofa sirtuin 
called SIRT6 leads to a modest extension of 
lifespan in male, but not female, mice. 

Does an extension of lifespan imply an effect 
on ageing? Not necessarily: interventions 
unrelated to ageing, such as giving insulin toa 
person with type I diabetes, can increase mean 
and maximal lifespan. The lifespan extension 
observed by Kanfi and colleagues in SIRT6- 
overexpressing male mice could be explained, 
at least partially, by SIRT6 acting as a tumour 
suppressor. Because male mice have a higher 
incidence of spontaneous cancer than female 
mice (incidences of 81% and 50%, respectively, 
were observed in this study), an anticancer 
protein (perhaps SIRT6?) would have a larger 
effect on lifespan in males than in females. 

Proving that a lifespan-increasing interven- 
tion indeed acts by delaying ageing processes 
is not a simple matter. For example, accept- 
ance of the idea that lifespan extension by 
caloric restriction (a diet with reduced calorie 
intake) reflects a genuine deceleration of age- 
ing emerged gradually from evidence’ that 
restriction slows age-related changes in the 
properties of proliferative and non-prolif- 
erative cells in many tissues, and does so in 
multiple organ systems. Similar cases are 
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Figure 1 | Potential mechanisms of action of SIRT6 on 
longevity. Several reports 


7,10-12 


being constructed by researchers proposing 
that dwarf mice could act as models for slowed 
ageing’. 

Reports of lifespan increases in mutant 
or drug-treated mice, particularly studies in 
which the observed effects are modest, often 
prove difficult for other laboratories to repeat. 
This is presumably due to subtle but crucial 
variations in the animals’ diet or genetic back- 
ground, or in husbandry practices®. More- 
over, the preference for publication of positive 
over negative findings inevitably inserts a 
smattering of false positive results into the 
literature, and these can be identified only by 
attempts to replicate experiments. One strength 
of Kanfi and colleagues’ paper’ is that SIRT6 
overexpression increased male lifespan in each 
of two groups of mice, which were derived from 
two different founder animals. However, the 
test for maximal lifespan — usually taken as 
stronger evidence than an effect on median 
longevity alone — reached statistical signifi- 
cance in only one of the two mouse groups. If 
the longevity effect seen by the authors proves 
robust, determining whether SIRT6 over- 
expression does indeed slow ageing will still 
require follow-up studies analysing a wide 
range of age-sensitive endpoints. 

In their article, Kanfi and colleagues 
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have demonstrated effects of the 
sirtuin protein SIRT6 on the activities of the hormones insulin 
and IGF-1, as well as on inflammation and DNA repair. These 
effects, together with a possible delay in cancer progression, could 
contribute to the increased lifespan in SIRT6-overexpressing male 
mice reported by Kanfi and colleagues’. 


include some observations hinting at 
potential mechanisms by which SIRT6 
overexpression might affect the lifespan 
of male mice. Compared with their nor- 
mal counterparts, SIRT6-overexpressing 
males had modestly reduced serum 
levels of the hormone IGF-1, and the 
signalling activity of IGF-1 receptors 
was weaker in peri-gonadal fat tissue 
in males but not in females. Previous 
reports have found that SIRT6 attenu- 
ates intracellular signalling initiated 
by IGF-1 and insulin’. Furthermore, 
dramatic deficits in IGF-1 and/or growth 
hormone (GH, which stimulates IGF-1 
secretion) lead to slower ageing and 
increased lifespan in at least four varie- 
ties of mutant mouse®. And mutations 
in the gene encoding the GH receptor 
in humans are associated with strong 
protection against diabetes and cancer’. 
So, it is plausible that SIRT6 overexpres- 
sion in mice might work through blunting of 
the GH/IGF-1 pathway. Evidence’ that rat lon- 
gevity can be augmented by surgical removal 
of intra-abdominal — but not subcutaneous 
— fat has begun to focus attention on meta- 
bolic and hormonal effects on specific fat 
depots as potential levers for pharmacological 
control of ageing. 

It is noteworthy that the effects of SIRT6 
overexpression reported by Kanfi et al.’ are 
seen only in male mice. Previous results®, by 
contrast, indicate that mutations in compo- 
nents of the GH/IGF-1 pathway usually have 
greater effects on longevity in female mice. 
This apparent discrepancy might be explained 
by differences between the mice in terms of 
underlying disease proclivities, levels of sex- 
specific hormones, inter-animal conflict or 
fat-tissue biology, leading to gender-specific 
responses to mutations, drugs and nutritional 
interventions. Working out the basis for these 
sex-specific interactions should provide clues 
to the mechanisms involved in these anti- 
ageing manipulations, and perhaps even help 
to answer the vexing question of why women 
tend to live longer than men. 

SIRT6 has other roles that could foster 
longer lifespan (Fig. 1). It promotes chromo- 
somal stability by several mechanisms, and 
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above-normal SIRT6 expression increases 
the efficiency of DNA repair'®. SIRT6 also 
reduces the expression of genes regulated by 
the NF-«B and HIF-1a proteins, which have 
roles in inflammation, cancer and, poten- 
tially, longevity'''’. It will be of interest to 
assess these aspects of SIRT6’s function in 
mice overexpressing the protein, and to test 
more definitively whether they contribute to 
protection against cancer and promotion of 
longevity. 

The recent spate of activity in sirtuin 
research, now supplemented by the present 
work, supports the case for placing the sirtuins 
on the front line of ageing research, sitting 
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cheek by jowl with other promising contestants, 
such as the proteins TOR, FoxO, AMPK, NRF2 
and ATF4. To paraphrase Winston Churchill, 
the discoveries of Kanfi et al. do not by any 
means represent the end of sirtuin research, 
nor even the beginning of the end. But they are, 
perhaps, the end of the beginning. m 
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